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ABSTBACT 

This report describes the r,aticnale and procedures 
for a series of studies related to reading tasks and reading texts. 
Children's reading tasks are exasined in five strands of 
investigation: ability to read for leaning, ability to read for 
doing, ability to read for resesbering, coaprehensioc difficulties in 
skilled readers, and classrooa observations. The text analyses 
consider six types of variables: orthcgraphlc-level variables, 
lexical-level variables, sentence- level variables, discourse-level 
variables, content-level variables, and other, extralinguistic 
variables* Overall, the report assuaes that reading is a process of 
iaposing an interpretation on a piece of textual aaterial and that 
coaprehension probleas arise vkenever an author's assuaptions about 
the knowledge, skills, prc7e£.slng capacity, and expectations of the 
reader are unfulfilled. (AA) 
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Introduction 

Too often educators and psychologists draw conclusions about research 
on an educationally relevant problem without adequate analysis of the 
nature of the problem itself. We feel that it is important to begin 
our study of reading comprehension with a detailed account of the tasks 
the young reader faces. We feel it is important here to analyze both the 
details of the tasks themselves" and the nature of the additional social, 
communicative, and related demands imposed by the classroom situation. 
Moreover we suspect that there are general teachable strategies that young 
children normally learn by trial and error which greatly facilitate his 
performance on these tasks. We therefore propose a study of these strate- 
gies with Sn eye toward making them an explicit part of reading education. 

A central part of the tasks under ^.naly^^s is, of course, the textual 
material with which the child is required to interact. Although an adequate 
analysis of a text cannot be accomplished outside of the context in which 
the reader interacts with the text, it is nevertheless useful to look 
very carefully at the nature of the particular texts children encounter 
at different stages of their educational development. At first the texts 
are primarily designed as vehicles for teaching the child decoding skills. 
By the fourth grade, however, the texts increasingly are intended to serve 
as vehicles for teaching content. Authors of these texts often attempt 
to control various aspects of the texts depending on the reason for which 
the texts have been devised. Texts devised for beginning readers in one 
way or another pay more attention to such things as orthographic structure 
and word frequency (see report on skills hierarchies). As the attitudes 
of the authors of textbooks vary, aspects of the text not under specific 



control by the author often undergo greater variation. Moreover, there 
are aspects of a text (such as overall structural characteristics) which 
are rarely carefully controlled. It is important for the researcher, as 
well as those involved in curriculum development, to understand both what 
the important characteristics of currently employed texts are and 
the effects these characteristics might hive on their comprehensibility 
for children at various stages of their development and with various cul- 
tural experiences. The accomplishment of this task requires: (l) that 
we develop methods of analyzing textual material which are applicable 

at all levels of analysis (orthographic, lexical, snytactic, semantic, 

aspects of textual 

pragmatic and general contextual/processing) which can' serve to analyze all 
relevant genres of texts (instructions, stories, essays) and which can be 
used as a basis for the analysis of these texts for readers at all grade 
levels K-8; (2) that we investigate the effects on comprehension of these 
various aspects of the text on readers at different levels of reading ability, 
with different interests and with different socio-economic backgrounds. 
These are clearly long term tasks. It is doubtful whether adequate answers 
to these questions can be developed even within several years. Yet we 
believe that a frontal attack on them is essential to the mission of 
the Reading Center. We are, of course, not beginning in a vacuum. Fortun- 
ately, recenu trends on textual analysis in education, psychology, linguistics, 
and artificial intelligence, all work in our favor and will contribute 
substantially to our efforts. 

In what follows, we proceed by first laying out our current research 
plan for the analysis of reading- relevant tasks. In this section we give 
special attention to the various reasons for which children read—reading 



for meaning, for doing, or for remembering/ We look at the strategies 
children use and can be taught to use in the abstraction of main idea3 from 
their texts. We propose to study tho classroom — ^lov; do t-ertcht,r^ and children 
really interact in the teaching of reading skills^ We propose tn look at 
the kinds of infer^-nces that are demanded cf children when they read 
texts designed for them, at their ability to make these inferences, and at 
what they do when they cannot make therri. We propose to look at the reading 
strategies employed by skilled readers when trying to construct interpre- 
tations of difficult to understand materials. Perhaps such strategies car 
be taught directly as part of reading education. We then turn to a dis- 
cussion of our current thoughts on text analysis. 

Texts, of course, can differ from one another on many dimensions. An 
adequate theory of textual processing must account for the ways textual 
and situational characteristics at tnese different levels can interact to 
give a skilled reader a coherent interpretation of the text. We must 
determine just what aspects of the text readers at various levels can 
attend to. In the section on textual analysis we describe existing and 
proposed research into tne textual characteristics relevant to the ortho- 
graphic, lexical, sentential, paragraph, and discourse level-. At each of the 
levels our research rlans are to focus on cnaracteristics of existing texts 
and how these characteristics interact with readers v^i-^h different knov/iedge 
bases reading for different purposes. 
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Introduction 

The goal of developing task analysis procedures is to make in-depth 
analyses of different aspects of reading in order to pinpoint where 
problems in comprehension arise during the transition period. Such an 
analysis can tell us both what reading skills should be taught and how to 
diagnose where children's difficulties arise. This research should look 
at children and adults performing typical tasks, using a variety of 
techniques to analyze their capabilities and their underlying processing. 

A jnain focus of this section is on how one goes about studying reading 
comprehension and study skills in the less-mature^reader . The main problem 
is one of externalizing an internal mental event ; the younger the subject 
the less likely it is that direct questioning will result in any worthwhile 
information. Not only is the younger child less able to express himself, 
but he is also less aware of his Ovm cognitive processes and less familiar -j 
with the self- interrogation techniques necessary to achieve the degree 
•of introspection needed for analyses such as those possible with adult 
readers. Thus it is important to develop methods of eliciting evidence 
of the child's self-knowledge, or lack of it, other than by means of direct 
questioning. 

As a second general point, it should be noted that we know very little 

o 

about the skills possessed by mature readers, let alone about the develop- 
mental sequence in which they are acquired. The development of a descriptive 
theory of the skilled reader's strategic processes is ^/^erequisite for 
informed developmental research. Only when we understand basic common- 
alities in the reading strategies of the efficient can we look intelligently 
for problems which can be the subject of remediation. 



It is not difficult co predict that we will find that older children 
are more strategic than younger children, and know more about reading 
effectively. It will also fail to amaze us if we find disadvantaged 
children particularly ignorant in this field. What is necessary is insight 
(based on our insights of mature processing), of particularly prevalent 
pockets of inefficiency .which are trans-situational, i. a strategy 
deficit which could effect performance on a wide variety of reaching tasks. 

» 

The identification of such deficits and the planning of suitable training 
programs is, of course, the aim of instructional psychology. In order to 
concentrate efforts on high pay-off areas, we need the information concerning 
what are the Essential strategies the adult possesses. 

Another general point whicTi should be considered in developing task 
analysis procedures is a taxonomy of reading purposes. A taxonomy suggested 
by Sticht (personal communication) is increasingly useful to us in thinking 
of what to look at in children. 

(a) Reading for meani/fig - reading in sufficient depth that one 
can extract the gist, with no intent to meet future tests; 
pleasure reading would fall into this category. 

(b) Reading for doing - reading instructions, signs, billboards, 
bus timetables, recipts, driving instructions, etc. 

(c) Reading for remembering - deliberate studying in anticipation 
of a future test.' 

These different tasks demand different levels of processing, different 
skills and strategies, and different levels of awareness. For these reasons, 
they should be kept somewhat separate. In'addition, different subject 
populations need' varying levels of skiUs on tne three. For example, educ- 
able retarded children, and many others who do not, or can not, enter 

U 



acadamic pursuits, do not nead to be efficient in studying. They rarely 
have to face examinations in their job situation. Training aimed at this 
population might be more profitable if it were focused more on reading for 
doing. This is the kind of reading that is required in everyday life 
situations, - one must read signs and notices, and job manuals and 
recipes, etc. Other types of r^^ading are a luxury. In the "Cloak of 
Competence" (Edgerton's, 1967 description of retarded invidiuals living 
"normal lives" in the community), only two of the 110 members of the cohort 
were found to read for pleasure (novels, etc.,, with the exception of illus- 
trated papers like comics and Daily News, Most of the cohorts, on the other 
hand, experienced reading problems of the reading for doing variety, and 
were troubled by them. They neelted to read advertisements of grocery 
specials, to follow instructions on assembling products. Intervention and 
training programs aimed at instruction- following seem particularly nacessary 
for such a population and would certainly have life-improving consequences 
if successful. 

With respect to these different types of reading, there are a number 
of questions that should be analyzed with task analysis procedures: 

1. What comprehension skills do skilled readers acquire for dealing - 
with different reading tasks? 

2. Which of these skills do different children acquire or fail to 
acquire? 

3. When do different reading skills develop from kindergarten to 
adulthood? ' 

4. How wt 11 does what is taught in schools teach the various skills 
necessary for skilled reading? 
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5. As reading tasks change, how do the skills required change? 

6. How well can the various skills that are needed for skilled reading 
be taught directly? 

Becaui>e these different questions require different techniques, this 
section proposes a number of different proceduiges for performing task 
analysis. Such di:versity permits the selection of the most su.ccessful tech- 
niques, and ensures that the task analyses will converge on the most 
important problems that arise in reading comprehension. The section describes 
d number of different task analysis procedures that are currently being 
piloted or considered as potential techniques t: at should be applied. 
Children's Ability to Read for Meaning 

A fundamental skill for efficient reading in school is the ability 
to abstract the gist, or main ideas, from a text. Our work on this topic with 
children is in the preliminary stages and has largely centered on methods 
of obtaining measureable indices of the child's knowledge. To date we 
have completed initial studies with normal children of from kindergarten 
to eighth grade, and retarded ch*^ Iren of approximately two years delay for 
each age. In general, we know that young children are better able to pick 
out the main idea of picture sequences than of stories presented verbally. 
They are also more likely than older children to be misled by red herrings, 
i.e., an irrelevant part of the picture made physically larger, or an 
irrelevant part of the story repeated or emphasized. 

In examining children's abilit- to si ;dy differentially the main ideas, 
we have attempted to obtain scorable data by having the child underline 
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key words jr sentences, underline or circle key words or write notes. To 
summarise v?e find: 

(a) Children below seventh grade, and junior high school educable 
retarded children do not tend to introduce such activities of 
their own volition. 

(b) Even when directly instructed to do so, they do not use the 
technique^i intelligently. For example, children who underline 
or circle words spontaneously tend to underline or circle words 
or idea units previously rated by adults to be key or important 
elements. Children who do not use the techniq es spontaneously, 
but only when instructed, tend not to underline key words but 
appear to'uehave randomly; or, they circle or underline long 

or unfamiliar words. We even have examples of children told 
to underline important points who underline everything! If they 
are told to underline, they will, but not strategically as 
spontaneous users tend to do. 
These studies are in their infancy, pilot data is not yet completed and 
technique refinement is still underway. Of importance is that these data 
are readily ocorable. For each child we have a marked study sheet. We 
can compare the underlined or circled words against adults' scoring of key 
or important elements, as in the Brown and Smiley (1977) study. 

Again tentritively it ^looks like there will be a hierarchy of these 
^skills, circling key words is easier than underlining key idea units, and 
note taking is particularly late in developing. 

We plaii to systematically observe the development of note-taking 
competence in the junior high and high school years. Sp^^cif ically, we wixi 
be looking for: 

(a) spontaneous use - when do children first start to take no^es 
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(b) quality of notes - this can be evaluated by having adults 
rate the clarity, organization, and representative nature of 
the child's productions. 

(c) use of note-taking - more difficult to assess - does the child 
take notes to solidify and organize what he is reading, or to 
provide a record which ' consulted, or both. Does note- 
taking help performance, etc. 

Several aspects of story understanding are important in our attempts 
to investigate children's abilities to read for meaning. We will discussj^ 
story recall data as a means of quantifying this ability, and an analysis 
of story understanding as a means for investigating higher level aspects 
of reading for meaning. 

In recounting stories it seems that even young children favoi the 
main ideas. However, they tend to recount simple action sequences, and" 
r»arely mention internal states (Mand] ^r and Johnson, 1977) or moods. 

Although . children ren'^'^mber the main message even in the absence of 
a deliberate attempt to concentrate on essential ideas, this does not exclude 
the possit)ility that deliberate strategies could be used to enhance com- 
prehension and recall of prose materials; indeed, it would seem unreasonable 
to suppose that the mature individual would net possess a rich variety of 
such skills. We suspect that the mature learner's awareness of his skills 
and of the task at hand would be particularly useful if longer preparatory 
periods were permitted for the study of material which must be maintained 
over some period of time, a more typical school learning situation. For 
example, in the Brown and Smiley (1977) paradigm, we would predict that 
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older children would use their knowledge of the importance of certain 
pieces of text to study efficiently, thereby enhancing their recall of 
important material. Younger children, not possessing the necessary 
"meta-comprehension" skills, would not benefit as much from the increased 

study timt- to focu;^^. on the essential. Thus their recall would Improve little 
if at all. We propose to investigate this hypothesis. 

The situation changes, however, if one trains children to use self- 
testing mechanisms for recall readiness. We have been working on this 
for some time. The basic idea is that the child is told to study material 
in anticipation of a tost. He is to indicate his readiness for that test. 
He has unlimited t .me . The measures of effective recall readiness are: 
time taken r-efore test, amount recalled at test, and observed activities 
during the stud> : erior . Thi^. is a very simple paradigm and can be used 
for simple list learning rote recM situations (Brown Barclay, 1976) 
and learning fr^om prc.o. F^von retarded children can readily be trained 
to wait before a tejt while they undertake some rudimentary self- 
monitorinp, procedures - and then they d_o benefit from the delay. Training 
on a lis^ recall tas^ poneralizes to prose recall (Brown 8 Camp lone, 1^77). 
We intend ro ^~'xtcnd this paradigm to more realistic study situations, and 
observf"^ and in tf^rrop.ate children ir ^he study period. 

If the ability to rate units oi a complex passage in terms of 
importance to th^ theme is a late developing skill, the implication for 
training study habits, r^-^adinc; comprehension skills and even the design 
of text mat'^rlals are important. 'hildren who have difficulty deterrnining 
what are the Key points of a' passage, can hardly be expected to select them 
for intensi/e study. uf ih^^rf^^t In this context j , an experiment by 



12 

Hershberger and Terry (196^) concern^^d with text learning in eighth- 
grade students. One variation which improved performance was printing 
all core or essential contents in red and all non--essential material in 
black. That is, some form of specific training or cueing in identifying 
important features of a passage may be needed before efficient study 
behavior will form a consistent part of the child 's "repertoire of cognitive 
skills. The task of identifying (and even training) such skills, pai-ii- 
cularly in slow learning children, will be a major feature of our work in 
this area. Mature readers appear to engage in self- interrogation to 
nionitor whether thev are conprehenuing the important points in a text. We 
propose to (1) provide good questions for young readers to model the kinds 
of question? they should ask themselves, (2) withdraw the questions and 
instruct children to generate questions of the type modeled, and (3) det- 
ermine whether this can be made a self-sustaining study strategy, 
Another major aspect of readin?^ comprehension that should '>e 
studied with task analysis is children's ability to make inferences about 
the motivations and actions in a story, and about the author's intentions 
in writing a story. The technique we have developed involves questioning 
children about stories given them with respect to a formal process theory 
of reading comprehend* ion (Adams & Collins, in press; Rumelhart ^ Ortony 
1976). This technique is an extension of techniques developed for the 
analysis of human inference strategies in answering questions and human 
tutoring strategi-s (Collins, Warnock Fassafiume, 1^75). The goal of 
this work i.: to determine * what inferencp skills children develop at dif- 
f«^rent :-oin^ in th^^ irin^. i^^Ion r^-'riod, -^rri wh^^thor thf^y have dovoloju-d 
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the necessaT7 skills for comprehending the texts that are encountered 
during the transition period. 

For our initial investigation in this area we have collected four 
short stories that are quite difficult to understand in terms of the 
motivations of the actors and the intentions of the authors. The stories 
include three fables from Aesop and a story about Ben Franklin from the 
Education Testing Service (1^60) Cooperative English Test of reading 
comprehension. The'se stories have been rewritten to make the vocabulary 
and sentence structure comprehensible to kindergarten children, and thus 
to remove these variables as sources of difficulty in comprehension. 

The procedure we will use is to have some children read the stories. 
We will encourage them to read for understanding and to reread whatever 
they do not understand. When they think they understand the story, they 
will be asked to retell in their own words what happened, and what they thought 
the point of the story was. Then they will be asked a series of. specific 
questions about each story. The questions are of four types: 1) about 
the world knowledge necessary to understand the story, 2) about the 
information in the text, 3) about the inferences necessary to understand 
the various actions and motivations in the story, and ^) about the author's 
intentions in writing the text. (One of the stories, and the questions about 
the story are shown in Tables 1 and 2.) This kind of questioning often leads 
the reader to further interpretations of the story, but the questions are 
ordered to minimize such effects. Even if rhlldren do revise their inter- 
pretations, the comprehension failures isolated by systematic questioning 
are likely to help us pinpoint where cc^prehension difficulties arise in the 
reading process. 
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TABLE 1 
Stone Soup 
by Aesop 

A poor man came to a large house during a storm to beg for 
food. He was sent away with angry words, but he went back and 
asked, "May I at least dry my clothes by the fire, because I am 
wet from the rain?" The maid thought this would not cott anything, 
so she lei him come in. 

Inside he told the cook that if she would give him a pan, and 
let him fill it with water, 'he would make some stone soup. This 
was a new dish to the cook, so she agreed to let him make it. 

The man then got a stone from the road and put it in the pan. 
The cook gave him some salt^, peas, mint, and all the scraps of 
meat' that she could spare to throw in. 

Thus the poor man made a delicious stone soup and the cook 
said, *^Well done! You have made a wonderful soup out of practically 
» ^thing." 
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table: 2 

Questions on Stone Soup 

1. Can you tell me what happened in the story? 

2. What was th^ r.tory meant to 3hO/^/? 

3. Was there anything you didn't understand when you 'first read it? 

4. Why did the man suggest making ^tone soup? 

5. Why did the cook think the man had done something clever? 

6. Why did the soup taste so good? 

7. Why did th^ man ask to dry himself by the fire? 

8. Why did the maid send the man away at first? 

9. Why did the maid finally let the man come in? 

10. Why did the cook let the man make stone soup? 

11. What did the man come to the house for originally? 
'12. Did the stone make the soup taste good? 

13. Wha^ Qlv.e was in the soup? 

14. What do you think of the poor man? 

15. How do yoa make home made soup? 

16. Did the man trick the cook in any way? 

17. Will a stone change in any way if it is heated in water? 

18. Can a stone be eaten? 

19. Was the story meant to show that stones are good for making soup? 

20. Was the story meant to show that if you want something enough there 
is always a way to get it? 

21. Was the ^.tory meant to show that it is possible to make someth Lng 
good, out of bits and pieces? 
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This study illustrates one kind task analysis procedure that can 
be applied to a variety of reading tasks that children of different ages 
are given. The procedures described should be used for thpse tasks that 
the survey shows are representative of reading tasks children face in school. 
One kind of reading task where the inferential problems are often of simi^^a^ 
magnitude to reading fable? is that of reading instruction. Instructions \ 
are often underspecif ied , so that children are faced with the problem 
of figjring out what are possible meanings of the instruction, given the 
constraints the real world imposes on the task (e.g. assembly instructions 
are constrained by the nature of the components available). But systematic 
questioning is only one technique that can be used to study inferential 
difficulties. Other task analysis procedures (described in this section) 
'^houl''' also be i.s*-i to -^t'ldv rhe children's capabilities in perf ormint; 
^differ^^pt readinf^ ta^k:^. 

Children's /^bility to Reao for Doing 

Task analysis of how children read instructions should have very high 
payoff (a) because of its importance for everyday life, particularly among 
retarded children, and (b) because it is easier to externalize the child's 
comprehension failures on a doing task — if he does not understand hfe will 
not perform ^.o^r^ action correctly. Thus, the experimenter will be signalled 
that a comprehension failure has occurred and can inlt^vene, question the 
child, analyse the terror patterns, etc. 

good way lo examine monitoring, detection and recovery mechanisms, 
is to set up ^;ituations where thr child must read tiie stepL. in a set of 
instruct-ions and th-n , erf"orm a t i ;V . I'" ♦he ta cm h*^ physiuallv^ enarated 
from the instruc' ionc this will further -^erve to externalize the child's 
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sequence of actions. The set up would be as follows: The child is given 
a sequence of instructions such as those of a recipe. The recipe will be 
in a central location, e.g., in a recipe card holder. The child Tiust read 
the steps and then perform actions at a geographically-separated location, 
e.g., read at the desk and perform at a stove or cutting board. In a 
kitchen situation such separation is by no means atypical. The separation 
is important as the child must read and then move away to perform. Presumably, 
moving away from the instruction would indicate that he is ready to perform 
the next step. Checking back would also require physically returning to 
the instruction; again, this could be easily noted by the observer, and 
questions could be asked of the child concerning his understanding of each 
step and his checking back behavior. 

Recipes do seem particularly suited for the task, as they require 
checking and monitoring as well as some commonsense assumptions. Consider 
this exa^nple from Crock Cookery by Mike Roy. 

French Cnion Soup 

^ Ibfe. butter ' 1 1/2 soup cans water 

6 onions 6 slices French bread 

2 cans beef broth 3/U cup grated Parmesan 

cheese 

In a skillet, melt butter and cook onions until lightly 
browned. Put them into the crock cooker along with the 
remaining ingredients. Cook on low for 4-6 hours. At 
serving time, spoon soup into heat-proof cups and top with 
toasted bread. Sprinkle with cheese. If convenient, place 
cups under broiler until cheese bubbles. 

f 
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This book is being considered for our pilot studies on following 
recipes because the recipes are simple ar^ the words easy to understand. 
But note the commons ens e checking problem involved. If you follow the 
instructions in the exact sequence, without reading the recipe through 
first, you put the bread and cheese into the soup and cook four hours. 
Result: disgusting soggy bread in soup, and no crisp bread and cheese 
to top the soup with! The demand for commonsense inferences like this is 
rampant in cookbooks, and particularly in so-called simple ones, where 
only a few steps are mentioned. 

Assembling toys is another promising area we are looking at. The 
task seems to have the same properties and can be treated like the recipe- 
following one. With educable retarded adolescents, we feel it necessary to 
choose material which they spontaneously try to read. From considering 
Edgerton's ethological descriptions of real-life activities of retardates 
in the community, we learned that they a^e really interested in driver 
education manuals. Teachers at the high schools confirm that the children 
are extremely keen to ],earn to drive. Thus such manuals seem particularly 
suited for studies with these populations. 
Children's Ability to Read for Remembering 

In considering the way in which children read for the purpose of remem 
bering what they have read, it is clear that the ability to grasp the main 
ideas is important. We saw earlier that underlining and note-taking are 
useful strategies toward this end^ as is self interrogation with respect 
to recall readiness. Iti this section, we review othei' study strategies 
that children learn and employ, and we discuss ways in which ^ome of them 
can be trained. 
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We are currently engaged in an exploratory investigation of students' 
study habits, or how children think they go about studying. We intend to 
conduct a semi-structured interview. We are devising a series of sec 
questions that will be put to the students in the context of an information 
sf»ssion where they will be encouraged to talk freely about exactly what they 
do when faced with various form:^ of study assignments (time spent, reading 
strategy, study strategy). The interviewer will practice various probe 
questions to elicit more information if the students are reticent. 

The aim of the project is to obtain preliminary data concerning how 
children.,set about studying. We are particularly interested in comparison 
between the seventh-grade students and seniors who nay have acquired idio- 
syncratic study routines. The senior class responses will be compared with 
those obtained froin college students. 

On the basis of this information, we hope to develop a formal interview 
technique to use as a diagnostic tool with the students from other schools 
particularly those who are less successful at academic pursuits. This would 
be a first step in our, intended program for developing training procedures. 

Because we anticipate difficulty in obtaining adequate verbal reports 
and introspections from younger children, we hope to use a cross-age tutor- 
ing program as a vehicle for forcing the children to explicitly state what 
they believe to be an efficient study plan. Children will be enrolled in 
a program of tutoring ai . at children (a) close to their own age, (b) 
much younger. They will be asked to help teach these less efficient learners 
to study effectively. In devising these instructional plans they will 
have the advice of an experimenter-observer. By observing their questions 
and plans concerning what to teach, we hope to gain valuable insights into 



20 

their knowledge concerning "how to study." Adequate debriefing for 
poorly instructed trainees will be provided. 

A similar idea is the little teacher idea the aim is the same, 
to provide a "real-life" situation which might elicit from the child any 
information he has concerning reading strategies. The basic idea is that 
children will be divided into small groups to participate in an academic 
project, e.g., a science demonstration. One child will be selected as the 
instructor each session. It will be his job to read the material, provide 
a summary for the others, and provide illustrations and demonstrations for 
his confederates. In the prer ration he will be i^ided by the experiment-er- 
observer as in the cross-age tutoring project. Each team will be told 
that they will be tested on the material and they must try to be the best 
team (to win a prize, etc.). The child's knowledge or ignorance concerning 
the main ideas, emphasis, illustrations, etc., and how to study material 
should be made explicit by this procedure. 

, As an extension of previous work with video-tape models (Brown, 
Barclay & Jones, 197 7), we are constructing video-tapes of children per- 
forming various study activities, such as rereading, underlining, self- 
testing, note-taking. We will see if observing children will model their 
performance after the tapes. Of more importance we will use the observation 
period to elicit evidence of the child's own knowledge concerning reading 
strategies - e.g., ask the observer why the model is undertaking a certain 
activity, if it will help him, does the observer do that, etc. 

As an extension of Markman's (1977) work on metaccfniit ion , we 
will ask children to act as critics. Specifically they will bo asked to 
rate the adequacy of a set of Instructions we have written, e.g., a book 
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of recipes, or a manual on how to use a toy, or play a game. The instructions 
will be obviously incomplete, . or misleading as in the onion soup example. 
Children will be asked to help us rewrite faulty instructions ^so that 
children of their age can read them. Again when informing us of the 
inadequacies of our prose, they should be revealing their comprehension of 
reading, and instruct ion- following. 

Determination of Comprehension Difficulties in Skilled Readers 

Comprehension difficulties often arise in the transition period 
because children move from reading texts designed to teach basic reading 
skills to texts that serve some educational purpose, such as expository 
texts and instructions. The early texts are based on vocabulary and situations 
that are relatively familiar to children. But the latter kind of texts 
involve riew ideas, new vocabulary, etc. Thus, a new set of problems 
arises such as how to deal with failures to understand different words and 
phrases, how to revise misinterpretations, how to select the important 
points for whatever purpose i: at hand (e.g., doing the task if given 
instructions, or remembering later if given expository material). These 
require a set of skills that children have not needed before. To some 
extent the same problems arise in aural contexts, but the appropriate 
strategies for dealing with such problems are quite different in aural 
and reading contexts. Hence, children have not learned strategies for 
dealing with such problems, and for the most part, they are not taught 
them. In such situations the brighter children develop strategies for 
coping and the less bright children lose interest. 
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An important goal of task analysis therefore, should be to determine 
what strategies skilled readers have developed for d^^ing with such 
problems. Then further task analysis should identify which of those 
strategies children learn, when they learn them, which children learn 
them, and whether children can be taught them. 

We can illustrate the importance of this kind of task analysis in 
terms of our preliminary analysis of skilled reader's strategies for dealing 
with comprehension failures. Coping with school must depend heavily on 
being able to figure out how to make sense of texts when the meaning is 
not obvious to the reader because of insufficient knowledge. We have carried 
out a preliminary study in which adult readers were given (Jifficult exposi- 
tory texts to read and comment on. After reading each paragraph, they were 
asked to comment on the following issues: 

1. What sentences did you fail to understand completely? 

2. What was the nature of the failure? 

3. Did you have any hypotheses as to what the meaning might be? 

4. What sentences either confirmed or disconfirmed previous 
hypotheses? 

5. If you do something other than read on, what do you do? 

6. If you jump i^omewhere else, why? 

7. Any other comments on what you do as you read. 

There were several striking aspects of the response protocols collected 
so far: 1) many words and phrases were only partially understood, 2) the 
reader would often formulate hypotheses about possible meanings, 3) later 
sentences often provided confirming or d u, confirm Lng evidence about previous 
hypotheses, and the reader would sometimes jump backwards or forwards 
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in the text to figure out the meaning. The s^ibjects were mostly unaware 
of these strategic aspects of their reading beforehand; slowing down 
the process highlighted the complex strategies that they used. 

These preliminary sessions led to a tentative theory of how skilled 
readers deal with comprehension failures. 

!• If a reader reads something that he does not understand, 
he may decide to take some strategic action or store the 
failure in memory as a pending question, 

2. If he stores it as a pending question, he may formulate a 
possible meaning (usually one) which is stored as a 
tentative hypothesis. 

3. If he forms a pending question, he usually continues to read. 
If a triggering event occurs after forming the pending 
question (e.j. too many pending questions or repetition 

of the same pending question) the reader may take some 
strategic action. 
5. Possible strategic actions he may take are: 

a. reread some portion of the text to collect ore 
information that will either answer a pending 
question or form a tentative hypothesis related* 
to a pending question. 

b. jump ahead in the text to see^ if there are headings 
or paragraphs that refer to the pending question and 
which might answer the pending question. 



24 . 

) 

c. consult an outside source (e.g., dictionary, glossary, ^ 
encyclopedia, expert) to answer some pending question. 

d. make a written record of a pending question. 

e. reflect about information that the student has in memory 
related to the pending question. 

f. decide to suppress a pending question because the text 
st'-'ioture suggests it will be answered later. 

g. quit reading the text. 

6. If the strategic action is successful, the reader usually 
continues reading from the point he last encountered the 
comprehension failure. 

7. If the strategic action is not successful, the reader 
usually takes some other strategic action. 

the research questicJns raised by the model of study depicted in the 
previous section include the following: text characteristics affecting the 
reader's ability to form a pending question, the nature of pending questions, 
events which act to trigger strategic actions, types of strategic actions, and 
the decision process for selecting a type of strategic action. 

Text characteristics affecting the reader's abUxty U form a pending 
question include the difficulty of vocabulary and logical structure of the 
text, necessary prerequisite information, and the depth of structure of 
Ihe information being read. It is important to know the extent to which the 
'nature of th*e text affects the student's ability to formulate pending questions 
since it will ultimately affect the studen"-'s ability to do something when 
he does not undei *:and what he is reading. 
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A more basic research question, however, concerns the nature of 
'pending questions. That i , wriat ic t hf- form in which skilled readers 
go about keeping tabs on their understanding? The fact that we call it * 
a ^^pendin?, qu^^stion^', implies tha*. it :s indeed a question, but 
this might not in fact be so. The reader may ask and observe his ability 
to answer questions about the text, but he may also observe his ability 
to fill In missing information and details, to paraphrase the text, to 
apply concepts and principles being discussed, to produce mental images 
of the text content, or to follow the logical structvj^e of arguments in 
the text. * 

Given that the pending^ "[uestion Itself is not al 'ays the stimulus 
for taking a stra^ei^ic action to correct misunderstandings, what are other 
-stimuli which cause strategic action to be tSken? Likely events which 
might act as triggering events includ2 an overflow in the pending file (i.e., 
too many pending questions), the repet. tion of a pending question, the 
disconf irmation of a previously formed hypothsis (either a tentative or a 
highly confident hypothesis, although each might trigger different 
strategic actions), formation of a pending question which the text treats 
as **given" information (i.e., treating a math proof as "obvious" when the 
reader does not see it as obvious), encountering a ponding question which 
the reader suspects is ery important, or encountering new vocabulary words 
easily explained through recourse to a dictionary. 

The next logical set of research questions concerns the types of strategic 
action which can be taken. Those indicated by the model described include 
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IcxDking back or forward in the text, looking to an outside source 
(e.g., another text or a teacher), thinking and reflecting about the 
problem or confusion, formation of a tentative hypothesis about "what 
the text is trying to say", doing some sort of record keeping such as 
•writing a tiote about the confusion, or quitting the study process for a 
while. We know that students engage in all of these activities on 
occasior, but which are the most appropriate for which kinds of under- 
standing difficulties? 

The last research question following from the model is the process 
by which the reader decides on (selects) a type of action. The major 
question here is whether there is ««ny relation between the type of compre- 
hension difficulty and the type of action. Another question is whether 
there is a default action which usually is resorted to if another is not 
obvious. For ^stance rereading may be a common and/or sensible default 
strategic action if no others come to rrind to the reader. If there is a 
relation between type of comprehension failure and type of strategic 
action, what is that relation? If there is none, what IS the selection of 
a strategic action related to? 

If we can formulate a precise theory of how skilled readers deal 
with comprehension failures, then it will be possible to design experiments 
and perform task analyses to study how well children develop such skills. 
Further it would be possible to teach these strategies and study how well 
children acquire and maintain them. This same research approach can be 
appli3d to various aspects of skilled reading: picking out main points, 
revising mi - Intr-rpretation-^ , reviewing tnxt:, road earlier, and so forth. 
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The goal is to formulate process theories of the important skills adult 
readers have unconsciously acquired to cope with the reading demands 
made on them. 

This proposed research involves a somewhat different use of task 
analysis than determining how children perform in various reading tasks 
(e.g. , reading instruction or stoi'ies) they are given in the transition 
period. Instead, this kind of analysis, some of which is conducted under 
computer control, investigates how readers deal with specific aspects of 
reading, such as comprehension failures, that arise in a variety of reading 
tasks. Both types of analysis saem critical to understanding the compre- 
hension difficulties that arise during the transition period. 
Observation in the Classroom 

Preliminary observation in. an urban classroom indicated that there 
are many ways that the various reading tasks and exercises given children 
fail to accomplish the goals intended by curriculum designers. For example, 
the following kinds of problems arose when children -were vforking 
exercises designed to teach vocabulary skills: 1) items were inappropriate 
to t|ie level of skill, or the background of the child, 2) tasks were of 
doubtful value to teaching vocabulary skills, 3) the children did not 
understand what to do, U) the teachers did not know what kinds of problems 
children were having. Analyzing where such failures occur is critical to 
understanding why difficulties aris^ in reading comprehension during the 
transition period. ' • * 



In order no analyze where, difficulties arise, sysjtemcitic obser- 
vation should be carried out in classrooms covering the span from kindergarten 
to eighth grade. The kinds of questions such task analysis should address 
are the following: 

1. If done properly, would the task actually teach the reading 
skills intended? 

2. Does the teacher under*=^tand the nature and purpose of the task? 

3. Does the teacher communicate the nature and purpose of the- task? 
Do the students ^anderstand the nature ar i purpose of the task? 

5. Are the students motivated to do the task? 

f 

6. Do the students have the background and skills to do the task? 
^ . Do the students have the time and resources to do the task? 

b. Do the students actually do the tasks? 

9. Does the teacher know how well the students are doing and what 
difficulties they are having? 

10. Does the teacher give help or feedback to the students, and is 
it important that s/he do so? 

11. How important are the skills taught this way in the overall 
process of reading? 

The task analysis procedures to be used should include tape recording 
and annotating events that occur in the classroom, and conducting interviews 
with ceachers and students. In particular, the observer should record the 
teacher's instructions to the students and annotate whit the individua] student 
does (with respect to the above questions). Any further interactions between 
the student and teacher should also be recorded. A portion of tho students 
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observed should be questioned while they are performing the task (in as 
\jiiobti?usive and non-leading a manner az possible) to determine the reasons 
for whatever difficulties they may have. All students shojld be questioned 
subsequently about any difficulties they had, and what they thought they were 
supposed to do and why. Finally, teachers should be interviewed after class 
as to what they thought was the purpose of the task, how effective they 
thought it was, and how well they perceived i^hat the students were doing 
and the problems they were having. Observations of this kind can only 
be carried out with a -few students in a few classrooms, but it should be 
applied to every kind of reading related activity in the classroom. 

Tnese in-depth analyses of reading in the classroom are quite different 
from the sampling of task contexts in the classroom (Goetz & Osborn, 1977). 
The attempt here is to evaluate when and why comprehension dif- 
ficulties arise in the classroom with respect to a theoretical analysis 
of the reading process. In this way we hope to discover which techniques 
now being used are most effective for developing student's comprehension 
skills and which are least effective. By comparing what is done in the 
classroom to a theoretical analysis of reading, it suggests what new 
strategies for teaching reading comprehension should be attempted. Finally, 
by analyzing how what is done is failing, it may be possible to make recom- 
mendations that would render reading curricula more failure-proof. 
Conclusions 

We have outlinoi^. a rather subs tan cic*'' body of work to be carried out 
in the analysis of r.one of the actual tasks children must carry out and a 
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discussion of some of the strategies involved in doing so as children 
become mature readers. Of course, even this body of work will not give us 
a complete picture of the task demands imposed on beginning readers- We 
have, for example, not given enough attention to the interaction of 
these task demands with cultural differences and other individual 
differences in knowledge and attitudes toward reading. We have, however,' 
laid out a program of research which will bring us a step closer to under- 
standing reading "from the crvild's point of view." Such an understanding 
will not only allow us to pinpoint problems of poorer readers and successful 
strategies of good readers, but will also provide information for the 
ultimate modification of the task demands themselves bringing them more in 
line with the theoretical accounts of the reading process we are develop- 
ing elsewhere. 
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Introduction 

Clearly the nature of the texts that children encounter plays an 
important role in their ability to read and understand those texts. The 
problem on which we focus here is that of isolating and investigating the 
role and relative importance the relevant properties of terxts possess, 
particularly as they pertain to reading. ^ We are especially interested in 
those properties as they contribute to tl^e "difficulty" of reading a 
text. Ad understanding of the role and importance of such properties 
can make it possible to establish reliable criteria for assessing the 
appropriateness of reading materials for various individuals, levels of 
development and for various tasks. This would allow, for example, the 
matching of text characteristics to the skills, knowledge, and interests 
of a particular student. It would also contribu e to the design of textual 
materials both for testing levels of reading ability, and for teaching 
relevant reading skills. 

Essential in this respect is th^ devlopment of adequate theoretical 
models to serve as a basis for our analysis and to support the development 
of relevant descriptive techniques for- classifying , evaluating, and coding 
texts. The development of such theoretical and descriptive tools must be 
our first order of bu:;iness, for without these we can neither adequately 
describe the characteristics of current textual materials nor can we 5tudy 
how these variables interact with a child »s state of knowledge, level of 
skills, and purpose in reading the text. 

A central tenet our theoretical perspective is that textual processing 
involves the creation of meaning by the reader based on the clues 
pro^rided by the author in the text. These clues come in many guises. 
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When an author constructs a text he chooses to make certain kinds of 
marks on the page because he believes that his projected audience will 
interpret those marks according to certain conventions. Some of these 
conventions are very explicit and a good deal of reading education is focused 
upon these. Thus, for example, the conventions that certain configurations 
of marks will constitute the letter "a", or that the sequence of marks is 
to be interpreted from left to right and from top to bottom are well 
.understood and knowingly adhered to. Other con /entions, ' such as that 
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certain configurations of letters correspond to certain patterns of sound 
are less reliable; ir some cases authors are careful -not to include too 
many words in which these conventions are violated. In other cases, however, 
words like "have", "would", "the", "love", etc., are used without realizing 
that .they form special cases which violate this convtation (although, of 
course, confonning to the conventional way of spelling these words). 

The general linguistic and communicative conventions that are employed 
which are only tacitly understood by the author constitute a different 
kind of example. Here, although an author may well be able to 
competently use these devices in an appropriate way, he may no-^ understand 
exactly what these devices are, and therefore may not appreciate the sophis- 
tication or cultural knowledge required for the appropriate use of them 
by the reader. For example, the proper comprehension of textual material 
invariably involves a rather large set of inferences that the author pre- 
supposes the reader Is able to make. These presuppositions are often made 
without conscious awareness and therefore without conscious monitoring by 
the author. Thus, although textual material devised to be simple may well 
contain little new vocabulary or only simple syntax, an author may 
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inadvertantly make his material very difficult to comprehend because of 
all he has ]eft unsaid. 

The overall difficulty of a text is a complex function of the ortho- 
graphic structure of the words employed, the difficulty of the vocabulary, 
the syntactic forms used at the sentential level, the organization of the 
sentences into higher level structures like paragraphs, the structural nature 
of the content of the text, the inferences necessary for comprehension, 
the role of illustrations, and so on. The role of each of these factors in 
a particular text obviously changes with the level of sophistication, 
knowledge, and motivation of the reader. It is also obvious that all 
of these levels interact in highly complex ways in the actual comprehension 
process, but for ease of exposition our proposed work will be broken down 
into the following somewhat arbitrary categories: of orthographic level, 
lexical level, sentential level, paragraph level\ story level, and discourse 
level. We will begin with but a brief discussion of orthographic and 
lexical aspects of text structure and move rather directly to a dis- 
cussion of the bulk of our proposed research on the sentential, paragraph, 
story, and discourse levels of analysis. 
Orthographic Level Variables 

Althourh orthographic complexity probably plays very little role in 
determining the overall difficulty of a text for mature readers, we presume 
that it is an important determinant of difficulty for readers who "sound 
out" new words in the course of processing a text. While we are generally 
not very concerned with ^'decoding" issues, this area cannot be ignored -- 
even^ for mature' reader:^.. For example;' for native speakers of English, 
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Russian names with unusual spelling patterns are clearly much more difficult 
than English names— even when both are entirely novel. Doubtless, ortho- 
graphic complexity plays a substantial] y greater role during the transition ^ 
period when children are steadily adding words to their reading vocabulary 
and mastering spelj.ing-to-so\ind correspondence rules. 

Within the literature on "readability" there have been many studies 
which have shown that various indicators of orthographic difficulty and 
lexical familiarity ,^'such as numbers of letters per word, number of syl- 
lables, and frequency, have a strong correlation with text difficulty 
(Borrauth, 1966; Klare, 197h -1975). In fact, in many of these studies these 
indicators were- found to provide the best single predictor of overall 
text difficulty. However, these indices of orthographic complexity are 
only crude engineering approaches to the issues of understanding what makes 
text difficult. Although no one has yet devised any superior alternative 
measures of orthographic complexity, we suspect that such variables as wo-^d 
frequency is an index of the probability that the .word in question is in 
the reading vocabulary of the child, whereas number of letters and number 
of syllables may well be cfrude indices of orthographic complexity. In any 
case, we feel that orthographic complexity is an important variable and we 
would like to determine how much of the difficulty associated with texts 
encountered during the transition stage can be accounted for by irregularity 
of the spelling-to-sound correspondence rules • Presumably as the reader 
becomes increasingly skilled, less and less of the difficulty will be 
determined by these factors and more and more by the ''higher-order" 
aspects of textual material. 
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A fuller discussion of a number of these issues appears in Schallert , 
Kleiman, and Rubin (1977). Suffice it to say that we recognize that all 
levels of complexity contribute to the overall difficulty of textual 
material and in the formulation of our research plans we must keep in 
mind the orthographic level as 3ll as the other more purely "textual" 
levels . 

Lexical Level Variables 

c* 

There is no clear notion what makes one lexical item more difficult 
than another. Historically, word frequency has been used as a measure of 
vocabulary difficulty. Thus, for example, Thorndike (1921a) discussed the 
issue and developed an early word count. The Teachers Word Book (1921b) 
to help educators do d better job of matching word difficulty to the 
knowledge of the stud^^nt. But why should a "word count" measure word 
difficulty? The assumpt ioTi._behind frequency as a variable is that the 
higher the frequency of the word the greater is the probability that it 
is in the reader's re-iding and/or listening vocabulary. There are a number 
of difficulties with this assumption. One Involves the fact that word 
counts are courits of surface lexical items and not meanings . Thus, for 
example, no distinction is made between tree in its usual meaning and when 
it is used as a technical term in transformational grammar--even though ^ 
there is presumably considerable difference in diff^iculty. Another problem 
involves the normative characteristics of such word counts. People in 
different cultural nilieux and with different interests have different 
linguistic experiences and thus different vocabularies. Thus, what may 
be frequent for ^'ne miy le rar^ for another. To what dee;ree do such 
iifferences account for differences in text difficulty for different' children 
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It is important to determine how much of reading difficulty can be attributed 
to such variables. No child, however good his decoding skills or his general 
comprehension skills, is going to be able to comprehend a text filled with 
an unfamiliar vocabulary. 

There are other problems with word frequency as a measure of 
word difficulty. The most basic is that difficulty depends not only on 
the number of times a child has encountered a word, but on its conceptual 
difficulty. Thus, for example, most children will find the word troglodyte 
unfamiliar and difficult. When told it means someone who live, in a 
cave, they should have little trouble learning the new word, since the 
underlying conceptual form is readily available. Oii the other hand, the word 
left in its political usage is quite different. Most children have encountered 
this surface form relatively frequently, but will have considerable trouble 
learning the new meaning, since the underlying concept is quite complex. 
While there has been little work on th j; specific problem, the recent work 
of Rosch on basic categories does begin to make contact with the issues ^Rcsch, 
Mervis, Gray, Johnson S Boyes-Braem, 1976), as does the classic work of 
Brown (195S). 

One other problem associated with the traditional approaches to voca- 
bulary difficulty is that it treats words in isolation, while the meaning 
that get assigned to words is clearly dependent on the discourse context 
in which the word occurs. Thus, for example, when we read of putting apples 
in containers we are more likely to understand container to mean "basket" whereas 
putting cola in containers is more likely to raise the image of bottles, 
(c.f. Anderson & Ortony, 1975). 



' Thus, even at the level of vocabulary much basic research n^ast be 
carried out before we can gain a true scientific understanding of "word 
difficulty/* V/e need to know how to represent the semantic and/or conceptual 
structure of words. We need to know how meanings are constructed. It 
should be noted that to i^esolve the problems of the comprehension of text 
by young children it will almost certainly be necessary to have" an under- 
standing of these issues--a mere engineering approach will not work. For 
ex Tiple, if one were tD select vocabulary on the basis of the reai. ability 
I , search i one might write text using words like' adz or discuss the concept 
of time in modern physics since the word time is of high frequency. Clearly 
a deeper un*der standing of the underlying issues is required beforo it will 
be possible to give well motivated advice about the difficulty 9f texts. 
Thus, we propose to evalucle the various models of lexical meaning that have 
been proposed (c.f . Schank, '1975, i^orman & Rumelhart, 1975, Miller & Johnson- 
Laird, 1976) and attempt to relate them to textual difficulty and to the 
more general discourse variables to be discussed below. 
Sentence Level Variables 

There has already been an eiiormous amount of research on the effects 
of syntactic struct .^res on comprehension; however, much remains" to be 
c'one in this area. 

Researchers in the readability area have shown that many surface 
syntactic features are related to text oiffi'culty. The most frequently 
studied of these surface measures has been sentence length, but various otner 
similar measures have also been inve- tigated — number of clauses, number of 
prepositional phrases, ratio of conjunctions to pronouns (Bormuth, 1966; 
Klare, 197U -1975). These variables tend to give correlations with text 
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difficulty in the .5 to .7 range; however, it is obvious that using these 
me..sures are not derived from a comprehensive theoretical analysis. It 
seems quite unlikely that replacing a sentence such as John likes chocolate 
ice cream and Bill likes vanilla ice cream with a shorter version such as 
John likes chocolate ice cream and B 



ill vanilla will ^ke f or >^s^er compre- 



hension. Onqe again, what is needed is an understanding of how and why 
particular- aspects of the 'sentence structure make for text difficulty. 

Another research tradition in the study of syntax has involved the 
study of the frequency of syntactic structures. The general approach 
has been to coun e fr»,quency of the patterns of children's spoken or 
written sentence*, build texts based on these counts, and then test the 
comprehension of the texts (Bormuth, 1368; Jongsma, 197U; Peltz, 1973-74; 
Ruddell, 1965; Smith, 1971; Tatham, 1970). Most of these studies have 
found the more frequent syntactic structures to be easier to comprehend 
•(Peltz, 1973-7i|; Ruddell, 1955; Smith 1^71-; Tatham, 1970). There haVe, 
however, been negative results as well (c.f. Jongsma, 19^14 

There are a number of serious difficulties with this "syntactic 
frequency" approach. First, the effect is not even clearly established. 
The studies that gave negative findings used multiple choice tests , while 
most of those that gave positive findings used the cloze techrique for 
measuring difficulty. Thus, the findings may not indicate that more fre- 
quent constructions are easier to comprehend, but rather that when filling 
in a blank children are more likely to use t-heir own forms of expression 
than they are to use the f:nns of the original text authors. Even if this 
problem is ignored, the ga->"hering of adequate data on frequency of syntactic 
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forms is a formidable undertaking. Most of the studies in this area have 
usedf the data of Strickland (1962), which is based or very crude sur- 
face analysis of syntactic patterns and a very limited sample. Solving 
these difficulties would require a subtle and practical procedure for analyz- 
ing the syntax of spoken utterances, something that is not easy to come by 
in the current stage in the development of linguistics. In addition, the 
fact that language is creative in the Chomskian sense means that even 
getting crude norms will require massive samples of language. 

Even though there are many difficulties with the syntactic frequency 
approach to comprehension difficulty, it does have some mild virtue as 
an applied solution. It seems likely, that after the appropriate research 
is carried at, that the linguistic fonAs used by children of a given age 
group will be easier than those U3ed infrequently. However, once again 
the relationship is likely to only be approximate, and so what is actually 
needed is serious scientific knowledge about which linguistic forms are 
more difficult and why. 

There has already been considerable work in experimental psychology 
on the difficulty of various syntactic structures. Much of the early work 
consisted of experiments designed to show that Chomsky's linguistic theories 
could be used as a psychological model (Miller, 195 5). 

The early work suffered from a variety of conceptual and methodological 
difficulties (cf. Greene, 1972 for a review) and is of little interest here. 
However, out of this tradition have come a number of studies that do attempt 
to give a detailed account of the difficulty of a limited set of syntactic 
forms . 



There have been a number of demonstration experiments which show that^ 
some linguistic forms are more difficulty than others (Coleman, 1964; 
Kintsch ^ Monk, 1972; Wang, 1970). 

Unfortunately, these studies do not contribute to our detailed knowledge 
of the causes of syntactic difficulty. They all show that intuitively 
more "complex" syntactic forms are more difficult to comprehend than are 
intuitively "simpler" forms. 

There Is a number of experiments dealing with difficulties of particular 
syntactic constructions. Thus, for example, as a rule, right-branching 
and center embedded sentences are rated harder to understand than left- 
branching sentences (Schwartz, Sparkman S Deese, 1970); center embedded sentences 
are easier wnen relative pronouns are employed (Hakes & Cairns, 1970); th&t 
complement constructions are more difficult with the "that" deleted (Hakes, 1972) 
sentences with negations are irore difficult than those without (c.f. Carpenter 
£ Just, 1975, Sherman, 1976) etc. 

All of these findi.igi, and many others are useful in as much as they 
give us a definite body of data ,on difficulties associated with particular 
forms. The problem, however, is that these results have been obtained in 
a rathv.-.-' piecemeal manner. What is required is a general theory of what 
kinds of syntactic structures arc more difficult and why. Bever (1970) 
has provided a perceptual processing hypothesis which accounts for many 
of these results. Yet even Bever 's principles do not always lead to a , 
cohe-^ent account of all of them. Perhaps the most promising account of 
the effects of syntactic complexity comes from Kaplan's (1972,197 4) attempts 
to apply the augmented transition network (ATN) as a serious model of 
sentencing processing, (see also Wanner S Marotsos , 197 4; Stevens & Rumelhart, 
1975) • It is our opinion ^ at the development of a general processing 
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model such as this should be explored further since it offers the most 
promise for devising a precise measurement 6f the syntactic, complexity 
oF textual material. Such a model could be automatically applied to a 
text by a computer so as to generate both a global measure of complexity 
and a local measure of the complexity of each senter^e and ^ach phrase of 
a sentence' (c. f. Kaplan, 197^ for^his discussion of transient processing 
load). - j' . * 

Our discussion so far has Reused on the syntactic structure of 
individual sentences. Of course, sentences do not occur in a Vacuum. 
Just as word meaning often cargnot be known witho^^ reference to the sentence 
in which the word, occurs, so too septence meaning often cannot be determined ' 
without reference to the general discourse in //nich it is embedded. In 
context, some of the syntactic effects mentioned above can be reversed. 
Thus, for example, although as a rule negatives are more difficult than 
positives Wason (1955) has shown that if the situation leads the hedrer^to 
expect a negative, negatives are easier than positives. Similarly, Olson 
& Filby (1972) showed that when passives are expected passives easier 
"♦"han actives, although the reverse is normally true. - 

A number of studies have shown that semantic factors in sentences can 
drastically modify the effects of syntactic form. For example, Slobin (1966) 
showed that children found nonreversible passives much easier to understand 
than reversible passives, e.g.. The .flowers are being watered by the girl 
war. easier than The cat is being chased by the dog . Stolz (1967) showed 
in a similar fashion that center-embedded sentences with semantic constraints 
'were much easier than center-embedded sentence without semantic constraint. 
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O/erall, it is clear that this type of experim/ental research must 
continue if we are to understand how syntactic complexitjj^af fects compre- 
hension* However, to be of maximum benefit to the overall issue of text 
compret^nsion there needs to be some shifts of emphasis. We will need to 
study a wider range of syntactic forms.- Better measures of comprehension 
must be developed. Both children and adults must be studied. More general 
models must?* be developed. The ^function of syntactic forms must be considered 
(why is a passive being used — for focus, for stylistic reasons, etc). The 
function of syntactic forms in discourse must be studied. This is clet.xy 
a fertile area of research and one qf much benefit to the eventual issues 
of text difficulty in the transition grades. 

Another quite different set of issues also arises at the sentence level 

due to the recent understanding that it is necessary to view sentences as 

much more than abstract syntactic structures. Austin (1962) caused a 

revolution in the analysis of language by pointing out that in order to 

understand actual linguistic utterances in communi'^-ation situations it is 

necessary to understand what speech acts the speaker is carrying out. For 

example the speaker might be asking a question, giving an order, or making 

i 

a statement. Austin proposed that utterances have an illocutionary force 
which conveys the intended speech act. 

One of the difficult problems that speech acts pose for a model of 
language comprehension is that there is typically a very wide variety of 
constructions that express the same basic intention. For example one can 
ask someone to close the door by saying: 
(1) Shut the door. 



(2) Can you shut the door? 
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( 3 ) I wish the door were shut . 
It's noisy in here * 
Clark and Lucy (1975) t>ave proposed an information processing account of 
the understanding of conversationally implied requests; They suggested 
that the hearer first obtains the literal interpretation of the utterance. 
Then the hearer checks to see if the interpretation is appropriate to the 
context. If the literal meaning is appropriate th^ it is taken as the 
intended meaning. If the literal interf)retation is inappropriate the 
intended meaning is inferred through a series of "rules of conversation" 
(Grice, 1975). 

The analysis of language in terms of speech acts raises a number of 
interesting questions which must be addressed in the context of textual 
analyses. Since a wide variety of linguistic forms can be used by a 
speaker/writer to express the same speech act, are some forms more difficult 
to comprehend than others? To some degree the use of indirect speech and 
certain other indirect forms of expressing speech acts is a culturally 
relative factor. Do some c\ idren hav^ special difficulty an understanding 
the implications of sentences like "Would you mind closing the door?" and 
if so, does this lead to any special difficulties in undevs'tanding the 
sorts of textual materials these children are expected to understand? 
Does the reader understand the intent of the author — to amuse, to 
persuade etc. ? All of these issues are new and have not been studied 
in previous readability approaches to text analysis. Do th'ese factors, 
in fact, play an important role in determining text difficulty? 

A final series of problems at the- sentence level are the various pre- 
suppositions and implications that must be understood before sentences are 
completely comprehended. Just and Clerk (1973) have studied the difficulty 



of the presuppositions artd implications that must be understood before 
sentences are completely comprehended. They assume that for a sentence 
such as John managed to find vhis hat there is presupposed John tried to 
find his hat , and the implication is that John found his haj: . Harris (197H) 
studied a variety of verb classes in complex-complement sentences and 
showed that adult subjects made the expected inferences. For example, in 
a rating task they indicated that the sentence John regretted that Bill 
was absent implies that the speaker believes Bill was absent , while the 
sentence Harry pretended that Bill was absent implies that Bill was not 
absent . In a second study Harris (1975) studied children's acquisition of 
this type of ui derstanding and found that some of the classes of verbs 
were not understood until quite late. Once again it is clear that the 
problems relating tc the presuppositions and implications of sentences 
is a complex and rich topic that must be studied before we can give an 
adequate account of text . 
Discourse Level Variables 

In our attempt to analyze textual materiel, we, of course, cannot stop 
at the level of sentences. -We have already pointed out that words must be 
understood in sentences in which they occur and that sentences must be 
understood in the discourse context in which they occur. Most of the 
work by psychologists and linguists over the last 20 years has focused on 
words and sentences; serious work on discourse factors in comprehension and 
text difficulty is in its' infancy. We belie-^e, however, that not only 
are discourse variahles essential to our understanding of comprehension of 
textual material, but that indeed these are the critical variables in thp 
analysis of textual material. The work on discourse ^^ariables can 
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conveniently be divided in two parts: the primary experimental work 
coming from psychology and the more formal work on discourse variables 
deriving from linguistics. We begin with a discussion of the psychological 
approaches • 

Psychological approaches > As we have already said, traditional research 
in the area of readability has used variables such as word length and 
frequency as the basic elements (Klare, 197U-1975). Consequently, it has 
failed to reveal any contribution that discourse level variables might make 
to text readability. That discourse level variables cannot be ignored is 
probably obvious once stated. 

Within experimental psychology there have only been a few studies 
directly relevant to the comprehension of discourse level phenornena and 
so it will be necessary to include in this section a number of studies ^ 
that used memory tasks which are only an indirect measure of comprehen- 
sion • 

Deictic aspects of language can play a considerable role in compre- 
hension. Brewer and Harris (1974) and Harris and Brewer (1973) have 
shown that if subjects are asked to remember sentences isolated from the 
actual time, place, person, and discourse context the deictic words are 
much more difficult to remember than the nondeictic words. In sentences 
with appropriate contexts the deictic words were little different from 
the nondeictic words. Thus, use of the d^eictic words away from their 
normal context seems to render them un interpret able and hard to remember • 
Much basic work on the comprehension of deictic elements remains to 
be done. For example, how does a hearer use knowledge about the non- 
linguistic aspects of the world to interpret deictic words - in sentences? 
• This may be an important problem in reading comprehension Ince the 



problem of deixis is somewhat more complex in written material than in 
spoken discourse. In written discourse the knowledge about time, place, 
person, and context must be explicitly presented or implied in the prose 
itself since the individual attempting to comprehend the material is not 
actually present and aware of the time, place, person, and context. 
Written discourse also makes frequent use of deixic to refer to aspects 
of the prose itself. Tlius , the writer refers to this argument or to the 
latter point. Everyone who has edited written material knows that the 
use of the vaguely specified this can h^ve a strong effect on the compre- 
hension of prose. These matters must be investigated with transition level 
readers and further discussion of them can be found in the report on 
Oral and Written Language. 

One of the crucial devices that holds text together is anaphoric 
references-^-aspects of one sentence referring back to someone or something 
in an earlier sentence. Lesgold (1972) showed in a probed recall task that 
sentences witti pronominal co-reference ( The aunt ate the pie and she was 
senile ) frequently functioned as a unit while sentences without co-reference 
( The aunt ate the pie and Alice was senile ) acted more like two separate 
units. 

deVilliers (197U) found a similar effect with determiners. He found 
that a text with definite articles in it (showing co-reference) was better 
recalled than the same text containing indefinite articles. Bormuth, 
Manning, Carr, and Pearson (1970) tested children's knowledge of anaphoric 
reference by a question asking technique. For example, the subject would 
hear a pair of sentence:, such as The boy fell off the steed . He fractured 
his arm . , and then would be asked Who fractured his arm ? Bormuth et al. 
tested a number of different forms and found a substantial range of 
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difficulty for children. Haviland and Clark (1974) looked at comprehension 
in terms of given and new information. They found that the second sentence 
of the pair Andrew was especially fond of beer . The beer was warm , was 
slower than the same sentence in the pair We got some beer out of the trunk . 
The beer wgs warm . Again the state of the area is such that we need more 
conceptual and empirical work before we can begin to understand the full 
complexities of the various anaphoric devices in text comprehension. 

Perhaps the most powerful discourse variable which has received atten- 
tion is the role of importance in comprehension. Clearly, importance is 
discourse level phenomenon. What is important in one context is unimpor- 
tant in ano1;her. It has long been known that more important information 
is betterrecalled than less important information (e.g., Bartlett, 1932; 
Binet & Henri, 1894; Gomulicki, 1956; Johnson, 1970). This recall advan- 
tage indicates that readers somehow extract, and given special prominance 
to those more important aspects of the text. Thus, if beginning readers 
are to become mature readers, they must learn those characteristics of 
texts which reflect importance. We, have proposed 3 series of studies on 
this general topic in the section of task analysis. 

One of the functions of a theory of content structure is to provide 
a theory of importance. Presumably, those aspects which are most important 
play salient roles in the structure of the content. Thus, accounts of 
discourse structure such as that by Rumelhart (1975, 1976), Schank 
Abelson (1975) etc. have focused on summarization as a key dependent 
variable in their research. The issues of content structure play such 
a large role in our proposed research that we have devolved an entire ^.ection 
to it. Thus, we put off further disc :>ion to that section. 
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Although the linguistic work on such discourse variables is limited, 
still it is more substantial than the psychological literature. It will 
be useful, at this point, to discuss this work before proceeding with 
3 discussion of effects of the global organization of textual content 
and other, more extrselinguistie variablesw 

Linguistic approach . An important and intriguing question both for 
research on comprehension and for the analysis of texts is to what extent, 
and in what ways, are purely linguistic devises used to provide clues , 
to the content structure of the text? Though the study of this question 
has only recently begun (at least in this country), there are clear indica- ^ 
tions from linguistic research that certain linguistic phenomena function 
more or less directly as signals of various aspects of text or discourse 
structure. 

There are four trends in* linguistic research relevant to this question. 
The oldest of these is the theory of "functional sentence perspective" 
(FSP), most closely associated with the Linguistic Circle of Prague, but 
including similar work by Halliday (though the terra FSP is not often applied 
to his work), and most recently work in the United States by Susumu Kuno 
that combines his view of FSP wixh certain aspects of transformational 
grammar. For a representative sampling of FSP, 3ee the various issues of 
Travaux Linguistique de Prague , the papers in Danes (197^), especially those 
by Firbas and Dane?; and Sgall (1975). Typical rela« d work by Halliday 
appears in Halliday (1970, i97U) and Hasan and Halliday (1976). Kvno's 
work can be seen in Kuno (1972, 1975, 1976). 

■J 
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Research in FSP goes back to the work of Mathesius and other linguists 
of the Prague School on the syntax of Slavic languages. Slavic languages 
generally have more highly developed case-marking systems and, probably as 
a result, considerably more freedo'n of word order than Engliih. But in 
studying variations in word order, Prague linguists noticed that word order.' 
is constrained by discourse factors. Elements expressing "known" or "old" 
information generally precede elements expressing "unknown" or "new" infor- 
mation. The primary difference in this respect between English and Slavic 
languages is that in English word order changes are brought about by 
rules that also change grammatical relations; Dassivization, for instance. 
Firbas (1964, 1955, 1971) has written in detail on the relation of English 
word order to information distribution. 

From these origins there has arisen an extensive literature on discourse 
notions like "topic", "focus", "comment", "given/new", "new/old information", 
and "communicative dynamism". Although there are differences among the 
various proponents of FSP, there is a common theme: close attention to 
language as a communication system (as opposed to the recent American bias 
toward description of language as an abstract formal system);' in particular, 
there is a growing interest in reflections in linguistic forms of discourse 
properties. Generally speaking, FSP linguists have a three- level approach: 
the level of semantic structure, the level of grammatical (roughly, syntactic) 
structure, and the level of the "organization of the utterance" (see Danes 
(1974) for discussion). It is the third level that is described in terms 
of discourse- functional properties and their reflection in graipmatical form, 
and it is this area that relates directly to the matter of the structuj^e of 
discourse, hence of texts. The interaction of such matters with questions 
of the structure of content needs close investigation. 

ERLC 
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We expect that many of the concepts of FSP theory, especially "topic", 
"new/old information" and the insighr that these matters at' least partially 
condition some syntactic properties, will be of great importance in under- 
standing the role of linguistic form in the interpretation of content 
structure. As we have seen, psychologists have already begun an investigation 
of the ."given/new" distinction and its role in comprehension (Havilahd £ 
Clark, 197^); 

The main problem with this approach (and all theories that deal with 
these matters) is that crupial , intuitively appealing notions like "topic", 
"given/new" and so forth, even though central to the story, are not ^ 
sufficiently clear and explicitly defined or described, so that it is not 
clear, how to apply them in any but the simplest cases. This makes it 
difficult to test the predictions. While we find Kuno's (1972) innovations . 
intuitively verv appealing, there are serious problems in evaluating 
them. The crucial notions "theme", "contrart", "exhaustive listing" 
and "neutral description" are not adequately defined, and are consequently 
difficult to tjsst. The immediate goal for relevant linguistic research 
in this area should be to develop more precise characterizations of 
central concepl^s , and some sort of replicable diagnostics for their 
occurrence. Some work along these lines has begun (see fcr example Chafe 
(1976), Keenan £ Schieffelin (1976), and Li S Thoir^scn (1976) for 
exploratory work). At the same time, the FSP framework should be 
compared as a model of discourse with other theories of text structure, 
such as those of Text Grammar, Grimes, Rumelhart, and so on. 

A second theory, more recent than FSP, i.o the theory of Text Grammar 
(TXT). This theory is concerned directly with as^^'=^'"ts of content structure. 
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Basf.d on the observation that many linguistic phenomena cannot be 
dealt with insightfully at the sentence level, proponents of TXT take 
the position that it is a mistake to take the sentence as the basic unit 
and domain of linguistic theory; rather, the text or discourre should 4)e 
basic. According to van Dijk (1972), discourse is the only justifiable - 
natural domain of an empirically adequate theory of language. The domain 

^entence-grammar is to be entirely subsumed in TXT, and no subpart of TXT 
will be isolatable in any natural way as a sentence grammar. Though TXT 
is still in a relatively embryonic state, more a program for resea^c!: 
than a full-fledged linguiGcic theory, it is possible to discern in it 
two main vari^iits: a formally-oriented variant, and a funct ionally-orieiited 
variant. 

Jhe former is a tendency to carry over from sentence to discourse domain 
the view of linguistic theory as a mechanism for generation of well-formed 
formal objects, independent of speaker, hearer, and communication. The goal 
of early TXT is to characterize "coherent text*^ generatively , by providing 
formal rules to generate the set of well- formed texts. The second variant 
of TXT is more functionally oriented. This work is even more programmatic 
than formally-oriented TXT. Its main proponents are Klein (19';2), Brirker 
(;^73), Schmidt (1973) , and Wunderlich ( 1970, 1971, 1972). Though this 
variant is also concerned with the structure of discourse, it io orientated 
less toward formal apparatus for the generation of text and more toward the 
construction of a model of communicative competence. Here linguistic units 
are not seen as isolated formal objects, but with respect to the socio- 
comraunicative context in v/hich they are used by s; eakers. Hence the star"»-ing 
point for functional theory and analysis of text grammar is linguistic 
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communication in Interac^tive situations. Rut even -^his approach has a 
rolo for t\\< analvi^is of (ii'>coups' ^tructurf, Sandi^, (1973), for example. 
arr,uos for a vl»?w nf ;»>xtr. .it. ^"^itlu.^ : oqut-ncos or hierarchies of speech 
acts, Rocent work by van Uijk (TJVf), 1976) seems to give a larger role to 
functional cons i vlorat iv;nf , < houi^h he is clearly still concerned with formal 
charactorizat ion ol w»>ll-tormed textj^. 

The promise of this work for the Analysis of the content of discourse 
will bo 'Uscussed in the tollowinf, section. It may provide useful descrip- 
f ions of th(^ relation betwf^»>n content structure and linguistic form, though 
TXT 'rk in this aT\-^a is not yet very specific. 

In this country, linp,uist-ic work relevant to text analysis is connected 
with two separate Lini^i^istic thtn^ries. The first is in work on discourse 
structure in the lamil'-^r fram'^woi'k oi ta^memic theory, associated primarily 
with Kerinerh Fike and Robert I.onp.acre , and their co-workers, (see Pike (1964), 
Lonr,actH^ (l^^GB, 1*^72)). In rhis framework the attempt is moide to apply the 
slot-f i Uc!^- tai^.rne^ine mode of syntactic cription to tne description of 
dir^.cour ac the level of sentence, paragraph, and discourse, \ielding 
a hierarchical analysis in terms^ of form and function. Attempts are made 
at clasc ifying. types of discourse ("narrative", "hortatory", "expository", 
etc,;- see Lon^.acr^e (IM7?), Chapter (f'i ve ) as a first step toward analyzing 
the form/ function relation in discouxje, and within this taxoiiOmy to 
relate discourse fanction to /-^ammatical form. This approach includes an 
abstract "d^ep structure" of the discourse, and rules relating this structuie 
to grammatical structure in terms of intersentential relations and gram-- 
matical devices, nak ing use also of case-like concepts like "agent", 
"instrument", etc. The Dulk of tagmemic work on discourse structure has 
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been pn languages of New Guinea and the Philippines, but the insights 
embodied in their analyses may be useful for the analysis of English texts. 
The examination of English discourse in contrast with other, languages may 
point up discourse devices unique to English. 

Work on the relation between discourse structure and linguistic form — 
syntactic properties in part icular--has been increased recently in generative 
grammar. It has become clear in this framework, as well as the ones discussed 
earlier, that rome itactic properties cannot be fully described without 
reference to discourse and content factors. This includes not only pro- 
cesses of prorominaii zat ion and ellipsis (Ross, 1969)^ Grinder S Postal, 1971); 
Morgan, I'^TS); Sag Kankamer, 1976 ), but also discourse 'Conditions on 
syntactic rules (Hooper * Thompson, 1973; Green, 1974, 1976; Morgan, 

1975; Gary, 1974; Ziv, 1976; and others). , This work indicates that 
syntactic properties can serve as clues to discourse structure at various 
levelL;. 

The most ambitious generative approach to discourse and syntax that 
grows out of this research Is contained in unpublished work by G. Lakoff ^ 
on the theory of "dual hierarchy granmar** . In this theory syntactic rela- 
tions are treated as predictable from a combination of discourse and semantic 
factors. This theory is yet in it:: etibryonic stages. 

Work Deing done in the generative framev/ork is by far the most explicit 
and detailed being done on matters of linguistic form. What it lacks is a 
coherent tho3ry of discourse on which to base its descriptions. 

One of tno firr^t rf^qui remonls for our theoretical research ' is to 
compare these appro< hcs and to answer such questions as: on what points 
do tney differ nuLstant LvoxV l)o they yield uc^^ful analytic procedure? 
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Hov can they be imporoved? An attempt must be made to bring together 

the detailtsd description of form of the generative work and the models 

of discourse of the others. For the time being, until a unified approach 

can be constructed, the most promising strategy is to continue detailed 

analysis using whatever rough-and-ready discourse description is wost 

r 

convenient; then once content analysis procedure (and content-analyzed 
texts) are available, work can begin on detailed study of the relations 
between content and linguistic form. 

In the meantime, the most pressing need is for more explicit theories 
of discourse. There are many anecdotally illustrated theories of discourse 
available, but in them there is widespread vagueness, inconsistency, and 
confusion on the exact nature of their central notions. (See Chafe (1976) 
for a brief but revealing discussion of the generally confused state of 
affairs,) Even such an intuitively simple notion as "contrast" has never 
been defined or modelled, and is likely to be surprisingly difficult. We 
dc not mean to say that none of the existing approaches are correct; only 
that it is very difficult to subject them to empirical test, and to "raw 
out their implications in detail, until they are made more explicit and rig 
orous. At this point what is needed is not so much grand theorizing as 
detailed ;:heoretical and empirical analysis, for th** sake of increasing the 
level of precision and rigor invqlved. This entails both an increase in 
conceptual precision and the development of explicit formalisnsto reflect 
that precision. The necessary research lies on the fading border between 
linguistics, cognitive psychology, and artificial intelligence, i.e., 
in "cognitive science". Realistic models of discourse will necessax^ily 
draw on all three, in that problems in the representation of knowledge, 
models of coinprehension, and linguistic semantics and pragmatics will all 
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have to be dealt with in the constructio jt discourse models of the nec- 
essary precision. 

The first step in this direction is to devise a program of research 
that inherently demands a high level of explicitness and precision, so that 
investigators will know iramediHtely when the model lacks clarity. One good 
strategy, besides the usual sort of theoretical research strategy, is the 
use of computer modelling techniques as a research tool. B'^ginning efforts 
will no doubt require much human intervention, but the consiiant goal should 
be the reduction of such human guidance. A good beginning would be to 
attempt a completely explicit formulation--computer modelling, if possible — 
of the simplest discourse notions, like "contrast" and "exhaustive listing". 
Given some degree of success with these, more difficult concepts like 
"topic" and "new infoxTnation" can be attacked, including attention to 
detailed questions like: Can there be more than one topic per sentence? 
If so, under what circumstances? Is it possible to have several topics 
simultaneously at different levels of discourse? If so, are they distinguished 
in any way in the linguistic system? How does the notion "topic" relate 
to syntactic phenomena of "topicalization"? What is the nature of the 
interaction, if any, betwe^^n *'topic" and "old information"? Are the two distinci: 
We fully expect that the number of questions like these that , arise will 
greatly exceed the answers arrived at ; s -ch a i 3sult will be both a sign 
of progress and an indication of the present state of igno^^ance. In this 
research those aspects of linguistic form should be a central factor that 
have been shown to be related to discourse structure, and new ones should 
be looked for, as they presumably will provide one of the best (and most 
easily controllable) teets of the appropriateness of the theoretical 
concepts being investigated. 
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Content Level Variables 

In addition to the various ways that text can be worded, structured 
or organized, it is also possible to study the structure of the content of\ 
the discourse itself. A body of work focusing on the logical and narrative 
structure of textual material has been accumulating over the past several 
years. In this section we sketch -the major boundaries of the work and 
indicate the major directions our own efforts in this regard will take. * 
. For the purpose of the analysis of text content, it is necessary to 
develop and test effective descriptive techniques for content analysis. There 
is a need for research on ,two fronts: first, further research on the role 
of content structure and inference in text comprehension with a view to 
.developing a model of comprehension; second, research on practical 
procedures for the descriptive analysis of cexts. 

Test structure and comprehension . The simplest and older,t notion of 
text structure is that the theme is the unifying element that runs through 
the next and binds the elements together. When the theme is vague and dif- 
ficult to discover, as in studies by Doolirg 5 Laciiman (1971), Braiisford & 
Johnson (1972), and Doolinj S Mullet (1973), subjects are unable to benefit 
from the theme relatedness of the passage, unl.^ss they are cued onto the 
theme with a title or p^'cture. Without such cuing, subjects find texts 
impossible to comprehend and difficult to remember. The manipulation of the 
theme by choosing the title given to an ambiguous passage can dramatically 
alter the interi,retat ion (Schaliert, 1976). 

Theme relatedness is a powerful determinant of recall. Subjects tend 
to remember the main theme of a ctory and to forget details or peripherally 
related events. Recall may therefore be characterized as an abstrac'»:ivo 
process. For example, Gomulicki (1955) investigated the immediate recall 
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of prose passages from 1^ to 200 words. He found that although subjects 
were able to recall the shorter passages verbatim, they were only able to 
recall the more important aspects of the longer passages. Further, he found 
that omissions were by far the most common error type, and concluded that 
memoryv for passages was most accurately described as abstractive process. 
In fact, when judges were given both recall protocols and deliberately 
written abstracts of the same passages, they were little better than 
chance at distinguishing between them. Theme relatedness determines the 
importance and memorability of information in discourse. Johnson (1970) 
measured importance of pausal units in discourse simply and directly by 
having subjects rate their importance. Rated importance was found to be 
a powerful determinant of recall. 

Importance has since been systematized by the development of text 
structures from which measures of importance could be derived. Meyer 6 
McConkie (1973) used quite a simple method of discourse structure 
analysis. They had graduate students outline a passage, and then on verted 
the outlines to tree structures. From these tree structures, three 
measures of the importance of an idea unit in the structure of the passage 
1/ere developed: a hierarchy depth score, which measured how high in 
the hierarchy the unit occurred; a unite beneath score, which ^measured the 
number of units which were beneath the given unit in the hierarchy; and • 
a combined hierarchy score, which combined the two above measures, equally 
weighted, into a single, unified measure. Significant effects upon recall 
were found for all three measures. Further, when significant effects of 
serial position and rated importance were found, these turned out to be 
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largely dr^ to the correlation of those factors with hierarchical importance. 
They also found that if a unit was recalled, then there was nearly a 70% 
chance that the uni"^ which occurred itiimediately above it in the tree was 
also recalled, although, overall, recall was only about 23%. Further^ 
combined hierarchy score was positively correlated with stability of 
recall across two recall trials. The effect of importance has-been 
replicated using more formal text structures (e.g, Klntsch, 1972, 1974 
Meyer, 1975, 1977.) Kintsch has replicated the results of Meyer 6 McConki e 
(1973), using his more formal and objective propositional" description. 
Kintsch 's (Kintsch S Keenan, 1973) propositional ranV is essentially equi- 
valent to Meyer and McConkies' hierarchy depth score, and Kintsch 's counting 
of descendant propositions is analogous to Meyer and McConkies' ^-nits 
beneath score. Essentially similar results have been reported using 
Runelhart's (1975) story grammar as the structural representation (c.f. 
Mandler 8 Johnson, 1977; Rumelhart, 1976; Thorndyke, 1975). 

"The body of research oh this topic is growing. We expect that further 
work along these lines ill follow naturally from our development of 
adequate procedures for encoding textual materials. We turn now to a 
discussion of- our proposed work on this tOpic, 

Descriptive analysis of text content . There are nany contenL variables 
that must be eva.luated and coded in the descriptive analysis of texts. 
Among these are the a^^thor's intentions ar various levels, inferences that 
must be made in comprehending the text, and the structure of the content 
of the text. 

For practical descriptive purposes; rough intuitive typologies will probably 
suffice for description of global aspects of authors' intentions like 
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subject matter, purpose (description, explanation, persuasion, practice, 
entertainment, etc.), and genre (narrative, instructions, diaglogue, etc.)* 

With respect to problems of inference it will be necessary to distin- 
guish among several types and levels. Types include inferences that do 
and do not require substantial background knowledge outside the text, and 
inferences involved in the interpretation of non-literal language. Another 
possible type of inference that needs investigation is the matter of 
"textual postulates". The question here is, are there "postulates" 
peculiar to written texts distinct from the "conversational postulates" 
that play a role in conversational interactions (see Gordon S Lakoff (1971), 
Searle (1975), Morgan (1977) for discussion)? If so, are there different 
"textual postulates" for different kinds of texts? 

Levels of inference range from inferences that must be made about 
the author's global intentions in the text, to proposition-level infer- 
ences involved in the interpretation of particular sentences, or in inferring 
connections between the proposition-level units of the text. 

An important practical goal of research on these questions should be 
the d- opment of replicable procedures and techniques for identifying 
and coding the various kinds of inferences involved in comprehending a text. 
Frederiksen (1975) has already made one such proposal. It would also be 
very useful to develop relative measures of the "explicitness" of text; 
that is, of tne amount of inference that must be made in comprehending the 
text . 

The third matter, the s-^racture of the content of the text, is clearly 
a very important factor of text analysis. The basic idea of text structure 
is not a new one, but F.oes back at ]ea^.t to the work of thu, Russian 
Formalists, Propp (1928) in particular, on narrative structure. There have 

t;. 
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even been occasional attempts to exi"end the apparatus of transformational 
grammer to the analysis of narrative structure, but this work has received 
very little attention in linguistics. Recently, however, the notion of 
text structure has begun to receive attention in psychology and, concomitant 
wi?h the growth of interest in discourse in linguistics; see for example 
Frederiksen (1972, 1976), Grimes (1975), Rumelhart (1975,1976), Schank (1975), 
and van Dijk (1975, 1976^. This work varies in rigcr, explicitness , gen- 
erality, and on many details of analysis. ^But iz is likely that the common 
threads running through this work will lead to useful descriptive tools. 

The common threads are these: (1) a multi-level approach to the 
organization of the content. This includes at least a propositional 
level, consisting roughly of the sentences of the text and proposition-level 
inferences, represented usually by a system reminiscent of a Fillmorean 
case system (see Fillmore, 1968) or "schemata" (see Rumelhart 6 Ortony , 1976, 
for discussion), (2) a higher level of organization of the text, usually 
hierarchical, except perhaps in the case of Schank (1975), where the 
structure consists of "causal links" among the proposition-levol units 
(explicit and inferred) of th , text. The details of this higher level 
differ; Rumelhart (1975), for example, analyzes story structure in terniJ 
of two paired sets of rules: a set of syntactic rules specifying the 
structure of the story in terms of functional categories like "setting", 
"episode", "event", and so on, and paired with each syntactic rule a 
"semantic interpretation rule" specifying the relations ("CAUSE", "ALLOW", 
"MOTIVATE", etc.) among the categories specified in the corresponding 
syntactic rule. Grimes (1975), on the other hand, treats these matters 
separately, as "non-events" and "rhetorical predicates". Some researchers 
(e.g., Meyer, 1975) supply explicit analytic procedures; in others the 



procedure is implicit. Some (e.g., Rumelhart (1975, 1976), Schank (1975), 
vanvDijk (1975), offer attempts at explicit summarization rules. It 
is generatlly at this level of content structure, in terms of relations 
between elements of the content and/or height in che hierarchical structure, 
that matters like importance and cohesion are treated. (3) The role of 
inference is crucial to this approach, both in "filling in the blanks" by 
inference at the prepositional level, and in inferring relations among 
che higher- level units of the text. * 

It should be possible to- develop from this work at least informal 
procedures for anlayzing and describing the content structure of texts. 
The ideal theory of content structure, of course, would be one which provided 
a procedure for analysis which did not rely heavily on the analyst; rather,' - 
the theory would be algorithm that would take as input a text, a context, 
and a model of knowledge of the world, giving as output a representation 
of the organizational and logical structure of the text. Givon the immense 
linguistic and pr>ychological problems that must be solved — ^ 
complete theory of linguistic semantics , and a coherent way of representing 
world knowledge — we do not expect this ^.oal to be reached in the near 
futur"^. In the meantime, existing research ai text structure should l?c > 
mined for analytic techniques by, first", attenjpting to extract from ^^^k 
appro<^ch a procedure for .analysis of text content ; second by beginnSil||^ 
to apply oach resulting procedure to selectod reading texts, from references 
obtained from text sampl ing r^o^arch; and finally, by comparing rer,ulls of 
various approaches and modifying procedures where necessary. Such 
modifications should be motivated by at least four considerations: F.enerality 
repl L-r^abi lity , 1»^V(U ot detail, and r'.'lativo merits. (a) Hcnerality: doer. 
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the procedure yield useful and perspicuous analyses of a wide vaiii^ty 
of texts? Rumelhart (1975), for example, seemfs appropriate only for 
stories; others (e.g., Grimes» 1975» Frederiksen5(1975) are more general. 
But it remains to be seen whether there is a "universal grammar" for 
texts. The most general scheme may fail to give insight into important 
differences between texts. XL" is conceivable that what is more useful 
is a repertoire of analytic schemata for various types of texts as, for 
example, proposed by Rumelhart (197^). (b) Replicability : does the pro- 
cedure produce essentially similar results when used by different analysts? 
There is some evidence of replicability in the literature (e,g., Meyer, 

1975), but the matter deserves more study. ^ (c) Level of detail: does 
the theory yield procedures with a good level of detail in analysis , or 
only very general aspects of structure? (d) Relative merits of the 
resulting analyses: are ^differences substantive, or merely terminological? 
Are there simple translation algorithms from one procedure to another, showing 
that they are equivalent? 

This work should not be done in vacuo , of course, but should be informed 
by theoretical research on comprehension and the representation of knowledge. 
The result of the research outlined here should yield: (a) ^ Dest analytic 
technique (or repertoire of J:echniques) for analysis of text content, 
(b) paper discussing the relative theoretical and practical merits of 
the various theories and procedures, (c) an initial set of contefft-analysed 
texts, for use in research on the relation between content structure ana 
linguistic structure of the text, (d) work in this area, together with 
research on the relation between text structure and linguistic structure. 
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should lead eventually to procedures and measures for evaluating the 
"transparency" of text structure; that is, the degree to which the 
intended^ structure of the text is easy or difficult for the reader to' 
discern, and for evaluating the relative complexity of text structure. 
Ot her Extra-linguistic Variables 

' • As we have emphasized throughout, aspects of the text which may appear • 
as important determiners of difficulty on one level of analysis can always 
be over- ridden by variables' at a higher level of analysis. In t)he last 
analysis', the * difficulty of a piece of textual material does not lie solely 
or perhaps not even primarily in the text. Rather, it is an interaction 
between the textual material, and the skills, knowledge, motivation, and 
expectations of the reader. On this view, understanding is really the ^ 
construction of an interpretation of a text consistent with the clues provided 
by an author. Those clues may be more or less difficult for any^given reader 
to interpret. This difficulty can depend on numerous variables in addition 
to the actual sequence of symbols normally taken to constitute the text. 
In this section, we turn our attention to a few of these variables. ^ 

The role of illustrations . No one can doubt that illustrations play an 
important role in comprehension. In early reading materials illustrations 
probably carry the bulk of the content of *the passage being read. However, • 
illustrations are far from a semantic crutch for the illiterate. They often 
play a critical role in the comprehension of ter.tual material. There is 
lirtde information available pinpointing the precise role such information 
plays in comprehension. There are, however, several studies which demonstrate 
that i^ can play an Important role. For example, Matz 6 Rohwer (1971) 
reported that the addition of redundant pictures increased the comprehensability 
of a passage presented aurally to fourth graders. Fhe increase was especially 
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significant for low-SES black students. In another study, Arnold S 
Brooks (1976) have demonstrated large effects on recall and ability to 
generate correct inferences when a text was accompanied by pictorial 
organizing — rather than verbal organizing- -material. In another very nice 
demonstration Bransford S Johnson (1972) showed how pictorial information 
can convert a totally senseless passa^e'^into a very meaningful one. To 
date,' most of these^exp^rments have been mere dem^^nst rat ions rather than 
contributions to a detailed account of the role of pictures in textual 



structure or in comprehension. 

One v/ay of remedying this lack of information would be co examine the 
role of illustrations in the comprehension of friaturally occurring) written 
material. A .first step would involve locating texts whore explanations 
are accompanied by pictures, charts, or graphs. One could then examine 
the shift in types of pictures found in texts used by elementary grade 
children (K thru 8). More specifically, the following questions would be 
addressed: (1) how much informaton is represented in tie pictures (i.e., 
how well can one understand the author's intent simply by looking at the 
pictures); (2) how much information in the pictures (graphs, charts) 
is dependent upon the text for clarification; (3) how much information in 
the text is dependent upon pictures, graphs, or charts t^o becoite comprehencible ; 
(4) what proportion of instances occur where the text is redundant and/or 
where the pictures are redundant? .It seems reasonable to expect that 
younger children will be exposed to more cases in which text depends on pictures 

Q 

than cases in which pictures depend on the text. Also, the proportion of 
redundant text-picture situations should decrease- with increasing grade 
level. 
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Following an analysis of what children naturally experience in terms 
of the relationship between linguistic and ron-linguistic information, 
experimental manipulations can be made which vary written oresentations 
along the four dimensions implied above and comprehens ' ->n could then be mea- 
sured. A comparison of particular interest would be - between self- 
contained, non-illustrated text, tex\ with entirely redundant illustrations, 
and illustrated text where neither text nor pictures carr^'es'the whole 
message (e.g., comics). It seems reasonable to expect that x^^S poor 
readers, who have not yet learned to extract the full meaning from "decontext- 
ualized text," will be adversely affected by conditions where the intended 
message is not cairied fully by accompanying illustrations. 

There is also evide .:e that illustrations play a role in providing 
clues as to the structure of the content as well. In a preliminary analysis 
in whic Rumelhart's (1975) story grammar was applied to several children's 
stories It was found that th^ illustrators of those stories had elected 
to provide one picture for each EPISODE (in tl:ie sense of the grammar) of 
the story. This suggests that we should look closely at those places in, the 
text illustrators choose to place their pictures and attempt to relate these 
places to ignif^'cant units of the story as definea by the various models 
of t xtual structure discussed in the previous section. 

The role of interest . Textual d'fficulty depends not only on the marks 
written on the pa^e, but also on the attitudes a reader brings to the task 
of making sense of them^ As a rule, oral language is used very purpose, fully . 
We speaV ^^o others and attempt to understand wl at they say not because we 
are instructed to do so, but because we' intend our speech acts to accomplish 
cei'tain results. In short; we speak and listen as communicative acts. 
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We read because we want ^ i ow about what is written. We write so as to 
express certain of ou> thoughts. For the child learning to read, reduiiig 
is often not a communicative act. Rather, children often read without any 
particular concern for the mes -ge under discussion, but because they must 
read to fulfill certain school requirements. If, children, iike. adults, 
could approach reading as a communitive act — a m^^ans of finding out what 
s/he wants to know, attitudes toward the reading process might well change 
and the child may well be induced to mak^ ''the effort after meaning" 
required for comprehension to occur. 

Perhaps the most obvious content variable which has been manipulated 
in this regard Is the child *s intei'est in the topic of the textual material. 
Presur^ably, if the a* thor is trying to communicate information about which 
the reader has a strong interest, the reader will more readily seek that 
information. Thus, considerable effort has been devoted to the assessment 
of c ildren*s Lntf^ rests in the contgnt of their texts. 

What is surprising is that this line of inquiry has not led to much 
research or the way in which children's interests influence comprehension. 
Blom, Waite, and Zimet (1070)5 who have done the i extensive content 
analysis of children's reading primers, have commented "What is needed 
is an invest '.r;at ion into hov; content actually affects children's attitudes 
and their acquisition of reading ^kill" (p, 219). 

MucF) of the literature a *Lme4 at determining wnether interesting material 
facilitates ciiiJdron'?. reading cornprehension contains sf "'ious methodological 
probler ^. Firc-t , tnere has heon a tendency to measure children'^ interest 
in a topi'" af^er tr.'"^ ' have read ? ^^ge on that top^'c (Bernstein, 1G55; 

Shnayer 19C7). Tnii. :roo^dur'^' conto^nJs the readin";, cornprehenr, ion meaiure 
with th-j interest ai^s^ "* m'^nt procoaurV- since childr^'^n nay report morr" intt^rest 
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a passage that they have just understood. A better procedure would be 
to assess children's interest independently of any reading material and 
independently of the reading task. If superior performance is associated 
with high-interest material, then the rival interpretation that children 
prefer material t^^t they do well on can be eliminated^ 

A second approach to the interest assessment problem has peen to 
assign material to children based on normative data on children's interests 
(Dorsel, 1975; Klein, 1969; Stejichf ield, 1967). While it is the case that 
children's interests are highly sex-typed (see, Asher, 1975; Tyler, 196H\ 
research in the area also indicates a fair amg^^nt of variability of interest 
in a topic among males or females.. The approach thdt should be adopted 
would be to individually assess children's interests and assign passages 
which are individually upropi^ate. 

A third problem in the literature of children's interests and reading. 
coTHDrehension is the type of reading comprehension measure employed. Studie 
have used reading achi //ement tests specifically developed for each study 
with no prior demons tra^. ion of tesx reliability or validity. In .many cases, 
item selection appears to have been arbitrary. 

Finally, most of .he research in this area has not sampled from a wide 
array of reading topics. Frequently, two passages are used, one of which 
is supposedly hign-interest and the other supposedly low- interest (e,g, , 
Klein, 1969). 

A recent study by Asher & MarkelT^ 197U ) provided an improved method- 
ology for studying the effect of reading content on children's reading 
comprehension, ^hildren^s interests were assessed ^ independently of any 

reading materi i ry having "^hem rate photogr^a^hs on an interest scale. 

"I 
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The pnotographs represented a wide array of topics. One week later, each 
child received from a s^^cond experimenter an individualized set of six 
passages. Tnr'tit^ pd^-ag^js cor'i'esponded to the child's three highest 
rated topics and three corresponded to the child's three lowest rated 
topics. The cloze px-^ocedure (Taylor, 1953) was used as a measure of reading 
comprehension since che procedure provides a replicable and objective method 
for generating test items, and since cloze test results correlate highly 
with results from standardized achievement tests (Bormuth, 1^67; Rankin & 
Culhar.e, l^GI). 

Th:^ methodology has bee^ employed in several experiments to date. 
The flrs^- (Asher & Markeli, 197^+) was conducted to assess the extent to which 
sex differences in reading comprehension raight be a function of the interest 
of the reading mat^?i,al. Elementary school boys often are found to read 
more pooir/ than girls on standardized tests (Asher 8 Gottman, 1973; 
'3ates, 1961). In addition, boys are much more li'^el> to be found in rt^medial 
reading clashes (Blom, 1971). -4 

Fc,:ultw of the initial experiment were striking Boys.^ reading compre- 
nen.Lon wd' considerably higher on high- than low-interest material. Girls 
did lightly better on high- than low- interest material, but the difference 
for ^'^,irl^. wa . not ota c is"_ i cai Iv significant. As for sex differences, the 
data indicated that girls scored significantly higher than boys on the 
low- inter'^s t la.^sages, r^ut on the h igh- interest material, the sex difference 
wa-. e 1 :nlr it--'i. The^^e dala provide a powerful demonstration of the effect 
of fr\^^ in^-r*-;"'' i\ na'erlal on reading cornr-rehension . The oft-clte.i 

^x dl f f '--^r'.n-^ Iii rv/aJIri7, ^omr rehcnsion appears to be at least nartly 
a funo' ^on -^f ^r.^ na"^ ;re t-f tht- m^t.^rial children are giv^-n to road. 
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The final experiment in this series (Asher, 1976) also found that girls 
as well as boys benefited from being given high-interest material. The 
ro^jor purpose uf this cxperxmcnt was to assess th6 sffect of intGrcst on 
black and ^hite children's comprehension, but sex was also included in the 
analysis. The results, with regard to sex (sae below for a discussion of 
race and interests) indicated that both boys and girls achieved higher cloze 
scores on the high- rather than low-interest passages. 

What accounts f^.^ the discrepancy between the initial experiment and 

... 

the siibsequent ones? Why did girls do better on high-interest material 
only in the latter two experiments but not the first? One possibility is 
that children in the initial experiTT^ent were developmentally different 
from those in subsequent experiments. School-administered reading achieve- 
ment data were available for each sample. All three experiments were 
conducted in the same school district so data from the same achievement 
test were available (Educational Development Series of the Scholastic 
Testing bt^rvLce). An interesting pattern emerges. In the first experiment, * 
girls outperformed boys on the standardized tests. In the subsequent experi- 
ment no sex difference was found. ^ 

Fifth gradt' is a transitional period witn respect to sex differences 
in reading coTij.rGhens ion. It appears that sex dil.erences are rather consis- 
tently obtained with younger children, typically not founa with older children, 
and inconsipt ^ntly obtained in fifth grade (Gates, 1961; Hughes, 1953; Stroud 
Lindquest, 1^^2). It m^^y be that tho effect of interest on comprehension 
interacts wlt'i th-^ developmental l^vel of children. In the earlier elementary 
years when ?.irl.. are outperforming boyr in reading they may be strongly 
oriented toward doing ^lell in :.chool. The content of reading material may 
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have less effect because they are seeking to do well regardless of the type 
of material they are asked to read. Later a shift may occur. Girls may 
cease seeking ar strongly to achieve for its gvti sake or fur eAlei-nal 
approval, and thus, become more responsive like the boys to the type of 
reading material they are asked to read. 

Vfnat is suggested, then, is that when elementary school boys and girls 
perform equally, it may be because girls have become less motivated to excel, 
regardless of the nature of the task they are given. Thus, in the samples 
when boys and girls performed equally on the school-a^'^-inistered stand'^rdized 
test, both sexes ^"ere fac?l"itated by interesting material. In the sample 
where girls achieved higher standardized test scores, boys but not girls 
were aff -ted by high- interest material. This interpretation is highly 
speculative but testable. One approach would be to study the effect of 
interest developmentally . Children below fifth grade (e.g., thirr" grade) 
should show stronger effects for boys than for girls. Older children (e.g., 
sixth graue) c:hould show effects for both sexes. Another approach would ^--^ 
be to sub-classif^y fifth grade children based on achievement scores or 
relevant motivational measures and determine whether certain groups ^^'^ 
children show the effect of interest and others do not. v 

Finaiiy, turn to a discussion of sorae unansw^^red questions. It 
can be concluded from the available data that children comprehend moi-^e of 
high- than low-intf^rest material. It has not been established, however, 
v;hv th's effect occur^ . The pt^ecedlng discussion about sex dif ferei' "js 
implies one t/p^ of exr J ana^ ion ; namely, that children compreh'^nd more 
of intf^re.^ting rnaiMi'iai r^ecau^'^' they ivn more mot i vat ^'d to ongagt:^ In the 
tac^ . A -/n^r M-:r. ell (197U), in attempting to explain their initial 
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findings, invoked one type cf motivational account. Citing data that 
boys and girls view reading as a feminine activity (Kagan, 195U; Stein & 
Smithcllc, 1950), Ashcr and Markell suggested that high-iajLtu'eiiL reading 
material, through its association with trac'i tionally masculine topics 
(e.g., basketball, race cars), might serve to make reading a more sex- 
appropriate activity. This ex^"' ''nation has the virtue of explaining both 
the fact that boys were facilitated by high-interest material and that girls 
were not. Girls, after all, would not need high- interest material to 
define reading as sex-appropriate. The problem with the explanation is 
that it doesn't handle the finding from later experiments that girls, too, 
perfoitned better on high- interest material. A more general motivational 
account is required. 

There i^ also the possibility that the interest effect is not a motiva- 
tional phenomenon at all. Children may try just as hard on low- interest 
as on high-interest material, but due to knowledge constraints do less 
well on the low- interest material. ^ Two types of knowledge variables can 
be suggested. First, children may have less familiarity with the vocabulary 
of low- interest topics. Hence, when attempting lo replace deleted words 
on the cloze task they may have greater difficulty when the material is less 
famili.=ir. A second possibility is that children have more elaborate and 
dif ff^rent iated cognitive structures with respect to topics they are 
interes*:od in. Inler^ebt in a topic implies that the reader has schemata about 
that to^ic which help him organize .material, infer the writer's intentions, 
anticipate future discussion in the text, etc. The greater availability of 
such ".chemata would certainly facilitate cloze performance on high- interest 
pas^age^ . 
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In the : ^al world of reading, knowledge and motivation are confounded. 
Attempts to stimulate interests involve imparting new information as well 
as motivating a student to explore a new area. New information, or old 
information presented in a new light, does much to spark interests. Although 
motivational and knowledge factors are confounded in everyday life, it would 
be instructive for theoretical, and perhaps for practical reasons as well, 
to be able to evaluate the relative contributions of motivational and 
knowledge variables. Most teachers probably assume a motivational explana- 
tion of interest effects. If, however, the effect of in teres t^is^not so 
much due to greater effort expended but to greater knowledge., then it would 
imply that a major barrier to reading comprehension is not the desire to 
read but a student's limited knowledge of the world. 

Some preliminary attempts to separate motivational and knowledge explan- 
ations have been made. Asher (1975) created a set of passages, each of 
which could be made to be about a variety of topics while in fact containing 
the same vocabulary. Each child received three topics associated with his 
or her highest rated pictures and three associated with his or her lowest 
rated pictures. The assignipent of topics to passages was random. 

The. results indicated no difference between high- interest and low- 
interest performance on these controlled- vocabulary passages. This would 
tend to support the view that some type o"^ knowledge variable accounts 
for interest effects when they occur. However, the post-reading rating 
children made of how much they would like tc read more about each passage 
suggest that the interest minipulation was not very strong. Although children 
rated the passages associated with high-interest topics significantly higher, 
the actual difference in ratings vas quite small. It seems that in 
controlling the vocabular/, much of the richness associated with a topic 
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was lost. The cat paragraph, for example, doesn't seem to have very much 
to do with cats. Because the manipulation of interest was weak in this 
study, the comprehension data cannot be confidently interpreted. The con- 
trolled vocabulary approach does not seem to be an effective way of testing 
the motivation versus knowledge explanations. 

Two other types of strategies might be attempted in future studies. 
One approach would be "^o provide strong external incentive for doing well. 
If poorer performance on low- interest material results from low motivation, 
then it should be possible to eqiialize performance on high- and low- interest 
material by providing additional incentive to do well. If, however, there 
are vocabulary or knowledge constraints on low- interest materials, then the 
additional incentive to try hard should not be that effective in reducing 
the gap between high- and low-interest performance. 

Another strategy would be to pre-assess children's knowledge with 
res7ect to certain passages. Perhaps wh questions (Bormuth, 1973; Finn, 1973) 
could be used to assess children's knowledge about specific passages 
prior to having them read the passages. Perhaps, given enough passages, 
there would be a sub- set of two or three for each child that would fall 
into each of the following four cells: high-interest, high knowledge; 
high-- Interest, low-knowledge ; low-interest, high-knowlfedge ; and low- interest , 
low- knowledge. This type of experiment could assess the strength of the 
interest effect, knowledge effect, and the interaction of these two variables. 

Another issue that might be explored is whether high- interest material 
decreases race differences in reading comprehension. The typical findings 
in previous r€»search is that the gap in reading achievement test performance 
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, widens with each passing year in school (Coleman et al., 1966; Singer, Gerard, 
"^and Redfearn^, 197S). The interest ingness of the material might well have 
a role to play either because black children are le^s motivated to read or 
because they have less amiliarity or knowledge about many of the topics 
they encounter in school. 

One interest study to date tested fifth grade black children and 
white children (Asher, 1976). Each child's interests were first assessed 
using ^he picture rating technique and two weeks later all children received 
a *: jt of six passages from the "irittanica Junior Encyclopedia. Three passages 
corresponded to the child's higlily rated topics and three to lowly rated 
topics. Results indicated that both black children and white children were 
facilitated by the high-interest material. The strong effect of topic interest 
for black children is promising in light of the nature of the reading 
material. It is difficult to conceive of a better representative of standard 
dialect material than the Encyclopedia Brittanica. The strong effect of 
interest contrasts with the modest effect that dialect variation seems to 
have on black children's reading comprehension (Hall and Turner, 1974; Nolen, 
1972). 

It should be noted, however, that interest and race did not interact 
in the Asher study. That is, the gap between blac) white children's 

performance remained the same on high- and low-interest material. One 
possibility is that somewhat easier material would produce different findings. 
Although the Encyclopedia is said to be for third through sixth grade 
children (Walsh, 1973), readability analysis using the Dale-Chall formula 
indicates thai; the passages are at a seventh grade level (Asher and Markell, 1974). 
Perhaps if more grade appropriate material had been used, an interaction of 
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race with interest might have been obtained.* It remains to be seen whether 
high-interest material can reduce the gap between Llar.k and white children's 
reading performance . 

One interesting secondary finding from the Asher (1976) study was the 
pattern of interest ratings for black and white children. There was considerable 
similarity of interests across race. Johns (1973) also has recently presented 
data suggesting that black and white children have similar reading preferences. 
These data suggest that the movement in the 1960 's and 1970's to write text- 
books with story content for black children may have overestimated the distinctness 
of black children's interests. Once again, it seems that the assignment of 
material should be based on individualized assessment of interests. 

Another issue deserving attention is whether achievement tests contain 
considerable amounts of low- interest material. A recent analysis of the 
characters or actors in achievement test passages indicates that a disproportionate 
number of characters are male (Faggen-Steckler , McCarthy, and Titcle, 197U). 
However, as Dwyer (l97U) has noted, there is a distinction between sex-role 
bias and content effects on performance. Story content is probably a more 
important determinant of performance than sex of the story characters. 
Research could be done to learn wh».ther children's performance on achievement 
tests passages is more similar to their performance on low- than high-interest 
material. If achievement test passages are functionally of low-interest 
then many children *s "ability to derive information from reading may be 
under representated by existing tests. 
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Summary 

Reading is the proctibb uf imputing an Interpretation on a piece of 
textual material* The ability to carry out this task efficie.itly and reliably 
involves the knowledge of and sensitivity to " 3 variety of linguistic con- 
ventions adhered to (implicitly or explicitly) by an author. Difficulty in 
comprehension can arise whenever the author's assumption about the knowledge, 
skills, processing capacity, and expectations are unfulfilled. Thus, we see 
every reading difficulty as a kind of communication failure. We see (at 
least one of) the goals of reading education to be the collection of skills 

/ 

and knowledge a child brings into the classroom v/ith those additional ones 
required to meet the assumptions of most authors and the limitations of the 
written mode of communication. In this report we have proposed to devtilop 
methods of isolating and making as explicit as possible those assuiriptions 
(and their textual consequences) which appear to cause young childrer^ the 
most difficulty. We suspect that such an analysis can serve at least three 
'purposes: (l) we can inform authors of children's ^exts about the assump- 
"tions that they (the authors) are tacitly) making; (2) we can inform teachers 
of the kinds of assumptions which are typically unfulfilled so that an edu- 
ca^tional program might be developed in an attempt to supplement a child's 
kncwled?,e or skills and (3) the patterns of difficulties will serve as a 
foundation for a more detailed theory of linguistic information processing. 
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